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OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



December 13, 2003, 04:37:24 ; Search time 1684.26 Seconds 

(without alignments) 
7460.491 Million cell updates/sec 

US-09-852-261-1 
517 

1 ggaccggagacgctctgcgg tgaaatacacaagtaaacat 517 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database 



EST: * 



1: 


em_estba : * 




2: 


em esthum: * 




3: 


em_estin: * 




4 : 


em__estmu : * 




5: 


em_estov : * 




6: 


em_estpl : * 




7: 


em estro:* 




8: 


em_htc: * 




9: 


gb__estl : * 




10 


gb_est2 : * 




11 


gb_htc:* 




12 


gb est3:* 




13 


gb__est4 : * 




14 


gb est5:* 




15 


em est fun:* 




16 


em_estom: * 




17 


em_gss hum: 


+ 


18 


em_gss_inv: 




19 


em gss pin: 




20 


em_gss_vrt : 


* 


21 


env_gss_fun: 


* 


22 


em gss mam: 


★ 


23 


em gss mus : 


* 


24 


em gss pro: 


* 


25 


em gss rod: 


* 


26 


em_gss_phg: 


* 


27 


em_gss_vrl : 


+ 



28: gb_gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o, 
o 



Result Query 





NO. 


Score 


Match 


Length 


DB 


ID 


Dpq nr-i r^i t" -i on 




1 


344 .2 


66 


.6 


796 


14 


CB959991 


CB95 99 91 AOENPOT JRT 


c 


2 


331.6 


64 


. 1 


558 


9 


AI503976 


AI503976 vm43d08.x 


c 


3 


330.6 


63 


. 9 


673 


12 


BM984670 


BM984670 UI-CF-EC1 


c 


4 


329.8 


63 


. 8 


623 


9 


AW146128 


AW146128 um37el0.x 


c 


5 


326.6 


63 


.2 


575 


9 


AI248089 


AI248089 ah69f05 x 


c 


6 


316.6 


61 


.2 


549 


9 


AI169253 


AI169253 EST215088 


c 


7 


315 . 8 


61 


. 1 


558 


9 


AI265629 


AI265629 ui04b07 x 


c 


8 


314 .8 


60 


. 9 


498 


9 


AA542914 


AA C >42914 ni9fir1(l q 




9 


310 


60 


. 0 


614 


14 


CD373004 


CD373004 TIT-R-HRO- 




10 


309 


59 


. 8 


816 


9 


AI119218 


AI119218 u^94h0? v 




11 


3 03 .6 


58 


. 7 


594 


10 


BF383724 


xJI _) CJ _» / OU16 UTTU Jii 


c 


12 


299 . 8 


58 


. 0 


527 


9 


AA913900 


AA91 ^900 nl ^ScifjR q 


c 


13 


289 . 6 


56 


. 0 


642 


9 


AI876493 


AI87649^ iiiS9b1D ->c 


c 


14 


287 .4 


55 


. 6 


499 


9 


AW495481 


AW4954R1 TIT -M-RfH - 


c 


15 


276 


53 


. 4 


468 


9 


AI169770 


AI169770 EST215669 




16 


274 .4 


53 


. 1 


882 


9 


AI604642 


AI604642 vm43d08 v 


c 


17 


268.2 


51 


. 9 


430 


9 


AI478804 


AI478804 tmS2e04 x 


c 


18 


263 .2 


50. 


. 9 


653 


13 


BQ200567 


BO200567 UI-R-nZT- 

X)^,J£j U U J V / W X. X V X-/£j J. 




19 


258 .4 


50 


. 0 


608 


9 


AL599807 


■ AL599807 T)KF%r>^1^0 


c 


20 


254 . 6 


49. 


.2 


486 


9 


AA993659 


AA9936S9 otflSall ^ 


c 


21 


254 .2 


49. 


.2 


521 


9 


AW4 934 5 9 


AW4 9'^4R9 TIT - 
jt.vv i j ju j vj x. i*i on j 




22 


254 . 2 


49. 


.2 


559 


12 


BI715603 


SD ± / J. _J O U J -H rf J*±lJ.J_U._y 




23 


254 .2 


49. 


.2 


602 


13 


BUS 9 07 10 


RT IS 9 07 ID AnFMPOTTRT 




■ 24 


254 .2 


49. 


.2 


621 


12 


BI221656 


RT"?91fiRfi ^09Q7AQRn 

DIZZ ID JD DUZ^jD^OU 




25 


254 .2 


49. 


.2 


1658 


11 


AK081019 


AK0R1 01 Q Miiq tmiQpn 

rllVU O x u 1 J l\LUo lllUot— U 




26 


2 54 


49, 


, 1 


356 


9 


AW297586 


AW297 E >Rfi IIT-R-RWft- 

-tx r V £j v / J u U U X. Xx OiV w 


c 


27 


253 .2 


49. 


. 0 


595 


9 


AI573421 


ATR7^4?1 mn04h1 1 v 


c 


28 


252 .6 


48. 


. 9 


499 


12 


BI676839 


^1676819 ir-Rfi^OPl -sr 


c 


29 


252 . 6 


48. 


9 


500 


9 


AA94 5553 




c 


30 


252.6 


48. 


. 9 


525 


9 


AA963258 


AA963258 UI-R-El-g 




31 


251.4 


48 . 


, 6 


482 


9 


AA456717 


AA456717 aal3h06.r 


c 


32 


251 


48 . 


5 


706 


9 


AI401719 


AI401719 th30bl0.x 


c 


33 


249.4 


48. 


2 


525 


9 


AI599751 


AI599751 EST251454 




34 


248.6 


48. 


1 


665 


9 


AA690767 


AA690767 vu57dl2.r 




35 


247 . 8 


47 . 


9 


559 


12 


BI715465 


BI715465 ic33b09.y 




36 


247 .4 


47 . 


9 


799 


9 


AI314558 


AI314558 uj48d07.y 


c 


37 


247 .2 


47. 


8 


499 


12 


BI294072 


BI294072 UI-R-DK0- 


c 


38 


244 .2 


47. 


2 


502 


9 


AI104669 


AI104669 EST213958 


c 


39 


243 


47. 


0 


561 


12 


BI714874 


BI714874 ic33b09.x 


c 


40 


240.6 


46. 


5 


564 


12 


BI714981 


BI714981 ic34hl0.x 




41 


239 .2 


46. 


3 


2170 


11 


AK038119 


AK038119 Mus muscu 




42 


237.4 


45. 


9 


558 


12 


BI715475 


BI715475 ic33c08.y 


c 


43 


237.2 


45. 


9 


480 


9 


AA621551 


AA621551 af47cl0.s 




44 


236.8 


45. 


8 


512 


9 


AI876203 


AI876203 uj59bl0.y 




45 


234 .2 


45. 


3 


949 


14 


CB589117 


CB58 9117 AGENCOURT 



ALIGNMENTS 



RESULT 1 
CB959991 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE . 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



CB959991 796 bp mRNA linear EST 29-APR-2003 

AGENCOURT_138 88 044 NIH_MGC_147 Homo sapiens cDNA clone 
IMAGE:30341081 5', mRNA sequence. 
CB959991 

CB959991.1 GI:30216107 
EST. 

Homo s ap i en s ( human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 796) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Stefan Hansson 
cDNA Library Preparation: Michael J. Brownstein (NHGRI) with help 

and advice from Piero Carninci (RIKEN) 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http://image.llnl.gov 

Plate: NDAM371 row : p column: 18 

High quality sequence stop: 707. 
Location/Qualif iers 
1. .796 

/organism="Homo sapiens" 

/ mol_type- "mRNA" 

/db_xref="t axon: 9606" 

/clone=" IMAGE: 30341081" 

/t is sue_type=" Human Placenta" 

/lab_host="DH10B TonA" 

/ c 1 one_l ib= » NI H_MGC_1 47" 

/note= M 0rgan: placenta; Vector: pBluescriptR; Site_l: 
all-xhol; Site_2: BamH; Oligo-dT primed using primer 
5 i - TTTTTTTTTTTTTTTTVN - 3 ' , s ize- selected for average 
insert size 2.3 kb and normalized to ROT 5. This is a 
primary library enriched for full-length clones and 
constructed using the Cap-trapper method {Carninci, in 
preparation) . Library constructed by M. Brownstein 
(NIMH/NHGRI, National Institutes of Health). Note: This is 
a NIH_MGC library." 
224 a 197 c 191 g 184 t 



Query Match 66.6%; 
Best Local Similarity 87.3%; 
Matches 455; Conservative 



Score 344.2; DB 14; Length 796; 
Pred. No. 8.3e-81; 
0; Mismatches 13; Indels 53; Gaps 5; 



Qy 


1 


Db 


180 


Qy 


61 


Db 


240 


Qy 


121 


Db 


300 


Qy 


181 


Db 


360 


Qy 


241 


Db 


420 


Qy 


3 01 


Db 


435 


Qy 


361 


Db, 


491 


Qy 


420 


Db 


551 


Qy 


478 


Db 


611 



IMIMIIIII IIMIIIIIilllllllllllilMMMIMII IIMIIIIMIIIII 



II I M III ill III III MINI MINI MINI! Ill III III MMIII! I MINI 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



II III III III II 



434 



IMIMIIIIIIIIIIIIIII I I I I I I I I .1 M II 

- AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 4 90 



I I II 



Mill I 



MMIII MIMMMMI 



MUM II II II MMIII 



MM 



M I I II II 1 1 1 1 II M I 1 1 II I II I II I II I II 



I 1 1 1 II II 1 1 M II I II 1 1 M 1 1 1 1 



MM 



RESULT 2 

AI503976/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AI503976 558 bp mRNA linear EST ll-MAR-1999 

vm43d08.xl Stratagene mouse diaphragm (#937303) Mus musculus cDNA 
clone IMAGE: 1001007 3' similar to gb:X04482 Mouse mRNA for 
preproinsul in-like growth factor IB (MOUSE);, mRNA sequence. 
AI503976 

AI503976. 1 GI :4401827 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 558) 

Marra,M., Hillier,L., Kucaba,T., Martin, J., Beck,C, Wylie,T., 
Underwood, K. , Steptoe,M., Theising,B., Allen, M. , Bowers,Y., Person 
,B., Swaller,T., Gibbons, M. , Pape,D., Harvey, N. , Schurk, R . , Ritter 
,E., Kohn,S. # Shin,T., Jackson, Y., Cardenas ,M., McCann,R., 
Waterston,R. and Wilson, R. 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



The WashU-NCI Mouse EST Project 1999 
Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 565223 

This clone was previously sequenced on the 5' end only, this new 

data is from the 3' end 

High quality sequence stop: 440. 

Locat ion/Qua 1 i f i ers 

1. .558 

/organism^ "Mus musculus" 
/mol_type="mRNA" 
/db_xref ="taxon: 10090" 
/clone=" IMAGE: 1001007" 
/tissue_type= 11 diaphragm" 
/dev_stage- " adult " 

/lab_host="SOLR (kanamycin resistant) " 
/clone_lib="Stratagene mouse diaphragm (#937303)" 
/note= M 0rgan: diaphragm; Vector: pBluescript SK-; Site_l: 
EcoRI ; Site_2 : Xho I ; Cloned unidirect ionally from mRNA 
prepared from diaphragm muscle. Primer: Oligo dT. Average 
insert size: 1.5 kb. Uni-ZAP XR Vector; -5' adaptor 
sequence: 5' GAATT CGG CACGAG 3* -3 ' adaptor sequence: 5' 
CTCGAGTTTTTTTTTTTTTTTTTT 3 1 " 
103 a 133 c 149 g 173 t 



Query Match 64.1%; Score 331.6; DB 9; Length 558; 

Best Local Similarity 82.0%; Pred. No. 1.7e-77; 

Matches 433; Conservative 0; Mismatches 84; Indels 11; Gaps 



4; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II II MINIM Mill MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

53 0 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 471 



Qy 

Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

MUM Ml II IMMMMMMMM III llllllllll Ml II III I I II Ml 

47 0 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCA^ 411 
121 ACAGG CATCGTGGATGAGTG CTG CTT CCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

MM MM MM I III II I MM II Ml III II I Mill Mill I Ml II I II II 

410 ACAGG CATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 3 51 



Qy 

Db 

Qy 
Db 



181 TGCGC^CCCCTCAAGCCTGCC^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

II II II II MUM I II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 1 1 IN 

35 0 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 2 91 
241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACC G 2 97 

Illllllllll lllllll I I Ml III II llllllll 1 1 1 1 1 1 II I 

290 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 231 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



2 98 AGAAGGAAAGGAAGTACATTTGAAGAAOVOVAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

iM I M I I I I hi hi I I IIIMIIIIIIIMII III 

23 0 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 171 

358 CAGGATGTA- GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Ml Mill II II III I I II I I II I II I I I I I I I I II I I I I I I 

170 CAGAATGTAGGAGGAG CCTC C CACGGAGCAGAAAATGCCACAT GACCGCAGGAT CCTTTG 111 

417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 470 

II I MINI I II III III II IIIIIII II 

110 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAACAATAATAAGTCCAATA 51 
471 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

Mill MM MMMI MMMMI M II II I M 1 1 II I 

50 ACATTACAAAGATGGGCATTTCCCCCAATGAAATATACAAGTAAACAT 3 



RESULT 3 

BM984670/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



673 bp mRNA linear EST 20-FEB-2003 
24-0-UI.sl UI-CF-EC1 Homo sapiens cDNA clone 
3 ' , mRNA sequence. 



BM984670 
UI-CF-ECl-abj 
UI-CF-ECl-abj -k-24-0-UI 
BM984670 

BM984670.1 GI : 19610417 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 673) 

Bonaldo,M.F. , Lennon,G. and Soares, M.B. 

Normalization and subtraction: two approaches to facilitate gene 

discovery i 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: McCray, PB 
McCray Lab 
University of Iowa 

2024 University of Iowa Med Labs, Iowa City, IA '52242, USA 

Tel: 319 356 4866 

Fax: 319 356 7171 

Email : paul -mccray@uiowa - edu 

Tissue Procurement: Dr. M . J. Welsh, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M . Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) or from Open Biosystems 

(www.openbiosystems.com). 

Seq primer: M13 FORWARD 

POLYA=Yes . 

Location/Qualifiers 
1. .673 

/organism="Homo sapiens" 

/mol_type= " mRNA " 

/db xref ="taxon: 9606" 



/clone="UI-CF-ECl-abj -k-24-0-UI" 
/tissue_type= n Lung" 
/dev_stage= "Adult and Fetal" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-CF-ECl" 

/note="Organ: Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l: EcoR I; Site_2 : Not I; 
UI-CF-EC1 is a normalized cDNA library containing the 
following tissue (s) : Normal lung from adult and from fetal 
day 64, day 87, week 19 and week 42. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I , and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT) 18 tail. The sequence tag for this library is 
AAGTGCTTAC . 
TAG_LIB=UI -CF-EC1 

TAG_T I SSUE = Nor ma 1 Lung Epithelial Cells Tissue nos 369-371 

and 380-383 

TAG_SEQ=AAGTGCTTAC " 
BASE COUNT 152 a 164 c 169 g 188 t 

ORIGIN 



Query Match , 63.9%; Score 330.6; DB 12; Length 673; 

Best Local Similarity 86.9%; Pred. No. 3.3e-77; 

Matches 453; Conservative 0; Mismatches 14; Indels 54; Gaps 6; 



Qy 


i 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

Ml 1 1 II 1 'i 1 1 1 1 1 1 1 il I.IM 1 M 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


492 


433 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAG 

Mill Milium i ! 1 1 1 1 i i f 1 1 1 1 f i i 1 1 1 1 1 1 i f 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 i t 

AGGGG - TTTTATTTCAGCAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


432 


374 


Qy 


121 


ACAGG CAT CGTGGATGAGTG CTG CTT CCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 

INN MM 111 Mill M II II 1 II 1 1 1 M 1 M M 1 1 1 1 M 1 II II II M 1 1 1 1 II 1 1 1 

ACAGG CATCGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 


180 


Db 


373 


314 


Qy 


181 


TGCGCACCCCTC^GCCTGCC^GTCAGCTCGCTCTGTCCGTGCCaVGCGCCACACCGAC 

Mill MM Ml MM MM III MMI II 1 1 II II M 1 II M 1 II II M 1 II M II M 1 

TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


313 


254 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTC^ 3 00 
1 1 1 1 1 1 1 1 1 1 1 1 1 I I 


Db 


253 




239 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGGAGTGC^GGAAACAAGAACTACAG 

III 1 1 1 M 1 M 1 1 M 1 1 II 1 1 1 M 1 M f 1 M 1 1 i 1 1 II 1 1 1 1 1 1 M 1 1 M 1 1 M 1 

AAGGAAGTAC^TTTGAAGAACGC^GTAGAGGGAGTGCAGGAAAa^AGAACTACAG 


360 


Db 


238 


183 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 

MM II Mill MM II MM II MUM II 1 1 1 II 1 M II II II 1 II 1 II II 1 


419 



Db 182 GATGTAGGAAGACGCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 123 



Qy 

Db 

Qy 
Db 



420 



122 



478 



TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

Mill IIMM Mill Mill III II ' 1 1 II M 1 1 1 1 1 1 1 1 llllll 

TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 6 3 



AAAGAT -GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

IIMM III MMMMMMMMMMMMMMM 

62 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 22 



RESULT 4 

AW146128/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



AW146128 623 bp mRNA linear EST 10-OCT-2000 

um37el0.xl Sugano mouse embryo mewa Mus mus cuius cDNA clone 
IMAGE: 2247498 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AW146128 

AW146128.1 GI:6167864 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 623) 

Marra,M., Hillier # L., Kucaba,T., Mart in, J . , Beck,C, Wylie,T. , 

Underwood, K. , Steptoe, M . , Theising, B . , Allen, M. , Bowers, Y. , Person 

,B., Swaller,T., Gibbons, M. , Pape,D., Harvey , N. , Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson,Y., Cardenas, M. , McCann,R., 

Waterston,R. and Wilson,R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 1006958 

Seq primer: custom primer used 
High quality sequence stop: 499. 

Location/Qualifiers 

1. .623 

/organism="Mus musculus" 
/mol_type=" mRNA " 
/strain="C57BL" 
/db_xref="taxon: 10090" 
/clone="IMAGE : 2247498 " 
/dev_stage=" embryo, 14 dpc" 
/lab_host="DH10B" 

/clone_lib=" Sugano mouse embryo mewa" 

/note= "Vector : pME18S-FL3; Site_l : Drain (CACTGTGTG) ; 
Site_2: Drain (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double-stranded cDNA was ligated to a Drain adaptor 



BASE COUNT 
ORIGIN 



[TGTTGGCCTACTGG] , digested and cloned into distinct Drain 
sites of the pME18S-FL3 vector (5' site CACTGTGTG, 3* site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
(University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5 ! end primer 
CTTCTGCTCTAAAAGCTGCG and 3* end primer 
CGA C CTG CAG CT CGAG CACA . " 
123 a 138 c 170 g 191 t 1 others 



Query Match 63.8%; Score 329.8; DB 9; Length 623; 

Best Local Similarity 80.6%; Pred. No. 5.3e-77; 

Matches 425; Conservative 0; Mismatches 92; Indels 10; Gaps 



Qy 



Db 



3; 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II I 1 1 1 M II II 1 1 M I II II I II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 

541 GGACCAGAGACCCTTTTCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 482 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

IMIIIMIII 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 1 IMMMMIMI IMIIMM MIMI 

481 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGC^ 422 
121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Ml Mill IMIIMM 1 1 I Mill 1 1 MM II I II III Mill Mill Mill 

421 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAAATGTAC 362 
181 TGCGCACCCCTCAAGCCTGC(^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M II II M MIMI I II 

361 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 3 02 
241 ATGCCCAAGACC(^GAAGTAT(^GCCCCCATCTAC(^A(^GAACACGAAGTCTCA G 297 

MINIM lllllll I I III III II I ; 1 1 1 1 1 1 MIMI M I 

301 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGA 242 
2 98 AGAAGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGGAGTGCAGGAAACAAGAACTA 357 

MIMI II Mill Mill MIMI III I MIMI MM II MM II Mill III III 

241 AGAAGGAAAGGAAGTACATTTGAAGAACCC^GTAGAGGAAGTGCAGGAAA(^AGACCTA 182 
358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Ml Mill II II III I MM I II I II II MIMI 1 1 MIMI 

181 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 122 
417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA - -AAAAATAAGTTTGATC 47 0 

M i mill i ii ii mi ii iiiiiii ii 

121 CTGCTTGAGC^CCTGCAAAACATCGAAACCCCTACCAAATAACA^ 62 
471 ACATTTCAAAGATGGCTVTTTCCCCCAATGAAAT^ 517 

Mill IMIIMM II Ml Mill II III II II Ml Mill 

61 ACATTACAAAGATGGGCATTTCCCCAATGAAATATACAAGTAAACAT 15 



RESULT 5 
AI248089/C 

LOCUS AI248089 575 bp mRNA linear EST 01-DEC-1998 

DEFINITION qh69f05.xl Soares_f etal_l iver_spleen_lNFLS__Sl Homo sapiens cDNA 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



clone IMAGE: 1849953 3 f similar to gb : X57025_rnal INSULIN-LIKE 
GROWTH FACTOR IA PRECURSOR (HUMAN);, mRNA sequence. 
AI248089 

AI248089. 1 GI :3843486 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 575) 

NCI -CGAP http : //www.ncbi .nlm.nih.gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 918 . Std Error: 0.00 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 380. 

Location/Qualifiers 

1. .575 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 184 9953" 
/sex="male" 

/dev_stage="20 week-post conception fetus" 
/lab_host="DH10B (ampicillin resistant)" 
/clone_lib="Soares_f etal_liver_spleen_lNFLS_Sl n 
/no te=" Organ: Liver and Spleen; Vector: pT7T3D (Pharmacia) 
with a modified polylinker; Site_l: Pac I; Site_2: Eco RI ; 
This is a subtracted version of the original Soares fetal 
liver spleen 1NFLS library. 1st strand cDNA was primed 
with a Pac I - oligo(dT) primer [5* 
AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3 ' ] , 
double- stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Pac I and cloned into the Pac I 
and. Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 
135 a 152 c 131 g 156 t 1 others 



Query Match 63.2%; Score 326.6; DB 9; Length 575; 

Best Local Similarity 86.6%; Pred. No. 3.7e-76; 

Matches 438; Conservative 0; Mismatches 15; Indels 53; Gaps 5; 



Qy 


16 


TGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCTTTTATTTC 

INI M Ml Ml II INI Mill MUM Mill Mill 1 M 1 II II 1 1 1 II II 1 II 1 

TGCGGGGCTGAGCTGGTGNATGCTCTTCAGTTCGTGTGTGAAGACAGGGGCTTTTATTTC 


75 


Db 


551 


492 


Qy 


76 


AACAAGCCCACA.GGGTATGGCTCCAGCAGTCGGAGG 

1 1 M M 1 M M 1 1 M 1 1 II 1 M 1 II II II 1 II M M 1 1 M 1 1 M 1 1 1 1 M II II M M 1 1 

AACAAGCCC^CAGGGTATGGCTCC^GC^GTCGGAGGGCGCCT(^GACAGGCATCGTGGAT 


135 


Db 


491 


432 



Qy 


136 


GAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTATTGCGCACCCCTCAAG 


195 


Db 


431 


1 IMI ill II IIM MINI Ml Ml III II MM || || || Ml Ml MMI Mill 1 

GAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTATTGCGCACCCCTCAAG 


372 


Qy 


196 


CCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCC^ 


255 


Db 


371 


MM II MM M 1 IM II II M II 1 MMM Ml II 1 II II M II 1 Ml M 1 II 1 IM 1 

CCTGCC^GTCAGCTCGCTCTGTCCGTGCCCAGCGCC^CACCGACATGCCCAAGACCCAG 


312 


Qy 


256 


AAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGAAGGAAAGGAAGTACA 


315 


Db 


311 


MIMIIMM 


301 


Qy 


316 


TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAGGATGTA-GAAGACCC 


374 


Db 


300 


MM Ml Ml II II II II IIM II II II II Mill Mill II III Ml II Mill Ml 

TTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAGGATGTAGGAAGACCC 


241 


Qy 


375 


TTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCTGCAC-AGTTACCTG 


433 


Db 


240 


Mill III II III Ml Mill MMM 1 IIM III MMM 1 Ml Mill 

TCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTCTGCACGAGTTACCTG 


181 


Ov 


434 




4 y 1 


Db 


180 


Mill MMM Ml MMMMIMMIMM MMM MMM Ml MM 

TTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTAAAAGATGGGCGTTTC 


121 


Qy . 


492 


CCCCAATGAAATACACAAGTAAACAT 517 




Db 


120 


1 1 II 1 1 1 II 1 II II 1 1 1 1 II 11 II 1 1 

CCCCAATGAAATACACAAGTAAACAT 95 





RESULT 6 

AI169253/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



AI169253 549 bp mRNA linear EST 08-JAN-1999 

EST215088 Normalized rat kidney, Bento Soares Rattus sp. cDNA clone 
RKIBP33 3' end, mRNA sequence. 
AI169253 

AI169253.1 GI:4134375 
EST. 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata M Vertebrata ; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 549) 

Lee , N . H . , Glodek,A. , Chandra, I., Mason ,T.M., Quackenbush, J. , 
Kerlavage,A.R/ and Adams, M.D. 

Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 

Gene Index 

Unpublished 

On Oct 6, 1998 this sequence version replaced gi:3705561. 
Other__ESTs: TC50779 
Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (301) -838-3529 
Fax: (301) -838-0208 
Ema i 1 : nhl ee@t igr . org 
Seq primer: M13-21. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Locat ion/ Qual i f iers 
1. .549 

/organism="Rattus sp." 
/mol_type="mRNA" 
/db_xref="taxon: 10118" 
/clone="RKIBP33" 

/clone_lib= "Normalized rat kidney, Bento Soares" 
/not e=" Organ :" kidney; Vector: pT7T3Pac; Site_l: EcoRI ■ 
Site_2: Notl" 
112 a 140 c 133 g 164 t 



Query Match 

Best Local Similarity 80.8 
Matches 421; Conservative 



Qy 



Db 



61.2%; Score 316.6; DB 9; Length 549; 
Pred. No. 1.7e-73; 
0; Mismatches 89; Indels 11; Gaps 



4; 



8 AGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCT 67 

MM M II I II 1 1 1 i 1 1 1 1 II 1 1 1 II |||||||| 1 1 1 1 1 1 1 M M I MMMI 

54 9 AGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCAAGGGGCT 4 90 



Qy 

Db 

Qy 
Db 



68 TTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCG 127 

MM MMMMMMMMI MMMMMMI Ml IMM II Mill MM 

489 TTTACTTCAACAAGCCCA.CAGGCTATGGCTCCAG 43 0 

128 TCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTATTGCGCAC 187 

I 1 1 II II M I M MIMMIIIIIIIIIIII IMIMIIIIIIIIIII II II I 

42 9 TTGTGGATGAGTGTAG CTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTACTGTG CTC 3 70 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



188 CCCTCAAGCCTGC(^GT(^GCTCGCTCTGTCCGTGCC(^GCGCCACACCGACATGCC(^ 24 7 

I M llllll I IIMIIIIIII II 1 1 1 1 MMMMMMI IMIIIMM 

3 69 CGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACATTGACATGCCCA 310 
24 8 AGACCCAGAAGTATCAGCCCCCATCTACCAACAAG 3 04 

MM MMIM MMMI III M MMMI I MM II MMMM 

3 09 AGACTCAGAAGT C CCAG CCCCTATCGACACACAAGAAAAGGAAGCTG CAAAGGAGAAGGA 250 
3 05 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAA 364 

MMMI MM Mill Ml IMM II I III I I IMM Ml MM I II Mill III 

24 9 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACTTACAGAATG 190 

3 65 TA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCTGCA 423 

II 1 1 M Ml I Mill III I II I MMIM || MMMM 

18 9 TAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCTGCTTG 13 0 
424 CAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATCACATTTC 477 

I llllll MM Ml MM II Ml MM M llllll 

129 AGCAACCTGCAAAACATCGGAACACCTGCCAAATA 70 

4 78 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

I MM IMIIIMM I Ml II Ml I MMMI II III I 

69 AGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACAT 2 9 



RESULT 7 
AI265629/c 

LOCUS AI26562 9 558 bp mRNA linear EST 18 -NOV- 1998 

DEFINITION uj04b07.xl Sugano mouse liver mlia Mus musculus cDNA clone 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



IMAGE: 1890901 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AI265629 

AI265629.1 GI:3883787 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 558) 

Marra,M., Hillier,L., Allen, M., Bowles # M., Dietrich, N. # Dubuque,T., 
Geisel,S., Kucaba,T., Lacy,M., Le,M., Martin, J., Morris, M., 
Schellenberg,K. , Steptoe,M. , Tan,F., Underwood, K. , Moore, B . , 
Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson, R . and 
Waterston,R. 

The WashU-HHMI Mouse EST Project 
Unpublished 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 975225 

Seq primer: custom primer used 
High quality sequence stop: 4 95. 

Location/Qualifiers 

1. .558 

/organism= M Mus musculus" 
/mol_type= n mRNA" 
/strain="C57BL" 
/db_xref ="taxon: 10090" 
/clone=" IMAGE: 18 9 0901" 
/sex=" female" 
/dev_stage= "adult" 
/ 1 ab_hos t = " DH10B" 

/clone_lib="Sugano mouse liver mlia" 

/note="0rgan: liver; Vector: pME18S-FL3; Site_l : Drain 
(CACTGTGTG); Site_2 : Drain (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

[ATGTGGCCTTTTTTTTTTTTTTTTT] ; double -stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Drain sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTTCTG CT CTAAAAG CTG CG and 3' end 
primer CGACCTGCAGCTCGAGCACA . " 
106 a 135 c 156 g 161 t 



Query Match 



61.1%; Score 315.8; DB 9; Length 558; 



Best Local Similarity 80. £ 
Matches 408; Conservative 



Pred. No. 2.7e-73; 
0; Mismatches 87; Indels 10; Gaps 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II llllllllllllllllllll III Mill Mill lllllllll 

506 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 447 
61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

I! Mill MM IMIIIIMMMIMI MM II IIIIIII II Ml MM Mill I 

44 6 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 387 
121 ACAGG CATCGTGGATGAGTG CTGCTTC CGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

Illlllll M Mill III I II I III Ml Mill III I II Mill MM I MM 1 1 

386 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 327 
181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M M M II MUM I II MM llllll II I Mi I II MUM I III III 

326 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 267 
241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCA- - -G 2 97 

IIIIIIIIIII IIIMM I I III Mi ll IIIMIII llllll M I 

2 66 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 2 07 
2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

IMIIIIIIIIIMMIIIIIIMIIIIIIIIIIIIIM IMIMMIMMIM III 

2 06 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 14 7 

3 58 CAGGATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Ml Mill II II Ml I MM llllll Illlllllllllllll 

14 6 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 87 
417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 470 

M I llllll I II Ml III II IIIIIII II 

8 6 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAA^ 27 
471 ACATTTCAAAGATGGCATTTCCCCC 4 95 

Mill MM I MM II MM 

2 6 ACATTACAAAGATGGGCATTTCCCC 2 



RESULT 8 

AA542914/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



AA542914 498 bp mRNA linear EST 19-AUG-199 r 

ni98cl0.sl NCI_CGAP_Pr2 1 Homo sapiens cDNA clone IMAGE: 984882 3' 
similar to gb : X57 02 5_rnal INSULIN-LIKE GROWTH FACTOR IA PRECURSOR 
(HUMAN);, mRNA sequence. 
AA542914 

AA542914.1 GI:2291394 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 498) 

NCI -CGAP http : //www. ncbi . nlm.nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 
Tumor Gene Index 



JOURNAL 
COMMENT 



FEATURES 

source 



M.D. , Ph.D. , Michael R. 



BASE COUNT 
ORIGIN 



Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Michael J. Brownstein, 
Emmert-Buck, M.D., Ph.D." 

cDNA Library Preparation: M. Bento Soares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
www-bio . llnl . gov/bbrp/ image/ image .html 
Insert Length: 603 Std Error: 0.00 
Seq primer: -4 0ml3 fwd. ET from Amersham 
High quality sequence stop: 412. 
Location/Qualif iers 
1. .498 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/ clone= "IMAGE : 984 8 82 " 
/sex="male" 

/tissue_type=" normal prostate" 

/lab_host="DH10B" 

/cloneJL ib= "NCI_CGAP_Pr2 1 " 

/note="Organ: prostate; Vector: pT7T3D-Pac (Pharmacia) 
with a modified polylinker; 1st strand cDNA was prepared 
from normal prostate bulk tissue, and was then primed with 
a Not I - oligo(dT) primer. Double- stranded cDNA was 
ligated to Eco RI adaptors (Pharmacia) , digested with Not 
I and cloned into the Not I and Eco RI 
modified pT7T3 vector. Library is not 
was constructed by Bento Soares and M. 
105 a 135 c 123 g 135 t 



sites of the 
normalized. Library 
Fatima Bona 1 do. " 



Query Match 60.9%; Score 314.8; DB 9; Length 4 98; 

Best Local Similarity 86.2%; Pred. No. 4.9e-73; 

Matches 450; Conservative 0; Mismatches 17; Indels 55; 



Gaps 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGGGC - TGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGA 59 
M I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I 
476 GGACCGGAGAACTTTTGCGGGGCTTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGA 417 



Qy 



Db 



6 0 CAGGGGCTTTTATTTCAACAAGCCCACAGGGTATGG 119 

Mill II MIMIMIMMMIMMIMMMIMMIIMMIMMMMIMM 

416 CAGGGGC - TTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCA 358 



Qy 

Db 

Qy 

Db 

Qy 



12 0 GACAGG CATCGTGGATGAGTG CTGCTTC CGGAGCTGTGAT CTAAGGAGG CTGGAGATGTA 17 9 

r 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j f j i r 1 1 f 1 1 1 1 j 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 j 1 1 f 1 1 1 1 

3 57 GACAGG CAT CGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTA 2 98 
18 0 TTGra(^CCCCTC2W3CCTGCC^ 239 

IIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMI 

2 97 TTGCGCACCCCTC^GCCTGCC^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGA 238 

24 0 CATGCCCAAGACCCAGAAGTATCAGCCCCCAT 2 99 

III I Mill I llllll 



Db 237 CATGCCCAAGACCCAG 222 

Qy 3 00 AAGGAAAGGAAG TACATTTGAAGAA CA CAAG TAGAGGGAGTG CAGGAAA CAAGAA CTA CA 359 

1 IMIMIIIIIIIIIIIIIIIIMIIIIIMII 

Db 221 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACA 167 

Qy 3 60 GGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCT 418 

MINI I IIIIIIIIMIIIIMIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 166 GGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCT 107 

Qy 419 CTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 4 76 

MMII lllllllll Mill MINI III IIIMIIIIIIIIIIIII 1 1 1 1 M 

Db 106 CTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTT 4 7 

Qy 4 77 CAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MMII Ml MINIUM II llllllllll MINIM 

Db 4 6 AAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 5 

RESULT 9 
CD373004 

LOCUS CD373004 614 bp mRNA linear EST 2 9-MAY-2003 

DEFINITION UI-R-GRO-csv-j -17-0-UI . rl UI-R-GRO Rattus norvegicus cDNA clone 

UI-R-GRO-csv-j -17-0-UI 5\ mRNA sequence. 
ACCESSION CD373004 

VERSION CD373004.1 GI:31157094 

KEYWORDS EST. 

SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus. 

REFERENCE 1 (bases 1 to 614) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares ,M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 

discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact: Soares, MB 



Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, IA 52242/ USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email: bento-soares@uiowa.edu 

Tissue Procurement: James Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Distribution information can be found at 
http: //genome .uiowa . edu/distribut ion/rat .html 
Seq primer: M13 REVERSE . 
FEATURES Location/Qualifiers 
source 1. .614 

/organism= "Rattus norvegicus" 
/mol__type= " mRNA" 



/ strain="Sprague-Dawley" 
/db_xref ="taxon: 10116" 
/clone="UI-R-GRO-csv- j -17-O-UI " 
/t is sue_type= "Whole embryo" 
/dev_stage=" embryo 13dpc" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_l ib= " UI -R-GR0 " 

Vnote= "Vector: pYX-Asc; Site_l : EcoR I; Site_2 : Not I; 
UI-R-GRO is a cDNA library containing the following 
tissue (s): rat whole embryo 13dpc. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. Denatured RNA was size 
fractionated on a 1% agarose gel. First strand cDNA 
synthesis was primed with oligo-dT primer containing a Not 
I site. Double strand cDNA was size selected according to 
mRNA size fraction, ligated with EcoR I adaptor, digested 
with NotI and then cloned directionally into pYX-Asc 
vector. The library tag sequence located between the Not I 
site and the polyA tail is CATCTCTACT. This library was 
created for the University of Iowa Program for Rat Gene 
Discovery and Mapping (Val Sheffield, Bento Soares and Tom 
Casavant) . " 

BASE COUNT 171 a 168 c 154 g 119 t 2 others 

ORIGIN 

Query Match 60.0%; Score 310; DB 14; Length 614; 

Best Local Similarity 80.3%; Pred. No. 9.9e-72; 

Matches 388; Conservative 0; Mismatches 91; Indels 4; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

HIM Mill II IIIIIIMIIIIIIIIIIII i 

Db 116 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 175 

QY 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

II I MM I II I I MINIM MM 1 1 II IIIIIMIMIII MIMIIII II III 

Db 176 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCT 235 

Qy 121 ACAGG CAT CGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

M Mill MMIMIMI MMMMIMMMIMM 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

Db 236 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 2 95 

Qy 181 TGCGOICCCCTCAAGCCTGCCAAGTC^^ 24 0 

M II 1 1 M I II I II I II I M M 1 1 1 1 1 1 II 1 1 II II I M 1 1 1| || I 1 1 1 

Db 2 96 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 3 55 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGrc^ 2 97 

I II M II II 1 1 1 1 II 1 1 1 MIIMI III II II Ml II I MM II I 

Db 356 ATGCCCAAGACTC^GAAGTCCCAGCCCCTATCGAC^CAC^ 415 

Qy 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGT^ 357 

M 1 1 1 1 1 1 M 1 1 1 1 1 1 ! I 1 1 1 1 1 1 M M ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 III 

Db 416 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 475 

Qy 358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

III INN II II III I 1 1 1 1 1 I II II I MM 1 1 1 1 1 1 

Db 476 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATTCCACGTCACCGCATGATCCTTTG 535 



Qy 417 CTCTGC^C^GTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCAC^TTT 4 76 

II I Mill 1 1 1 1 III MINI I II III II I 

Db 536 CTGCTTGAGCAACCTGCANAACATCGGAACA 595 

Qy 477 CAA 479 

I I 

Db 596 CCA 598 



RESULT 10 
AI119218 

LOCUS AI119218 816 bp mRNA linear EST 02-SEP-1998 

DEFINITION ue94h02.yl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE: 1498803 5' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 

ACCESSION AI119218 

VERSION AI119218.1 GI:3519542 

KEYWORDS EST . 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 816) 

AUTHORS Marra,M., Hillier,L., Allen, M., Bowles, M. # Dietrich, N. ,. Dubuque, T. , 

Geisel,S., Kucaba,T., Lacy,M. , Le,M., Mart in, J . , Morris,M., 

Schellenberg, K. , Steptoe,M. , Tan,F., Underwood , K . , Moore, B. f 

Theising,B., Wylie,T., Lennon,G. , Soares, B. , Wilson, R. and 

Waterston,R. 
TITLE The WashU-HHMI Mouse EST Project 

JOURNAL Unpubl ished 
COMMENT Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 936407 

Seq primer: custom primer used 
High quality sequence stop: 4 73. 
FEATURES Location/Qualifiers . 

source 1. .816 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL" 

/db_xref="taxon: 10090" 

/clone=" IMAGE: 14 988 03" 

/dev_stage=" embryo, 14 dpc" 

/lab_host=" DH10B" 

/clone_lib=" Sugano mouse embryo mewa" 

/note-"Vector: pME18S-FL3; Site_l: Dralll (CACTGTGTG) ; 
Site_2: Dralll (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double- stranded cDNA was ligated to a Drain adaptor 
[TGTTGGCCTACTGG] , digested and cloned into distinct Drain 



sites of the pME18S-FL3 vector (5' site CACTGTGTG , 3' site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
(University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3 * end primer 
CGACCTGCAGCTCGAGCACA. " 

BASE COUNT 230 a 219 c 172 g 187 t 8 others 

ORIGIN 



Query Match 59.8%; Score 309; DB 9; Length 816; 

Best Local Similarity 80.2%; Pred. No. 2e-71; 

Matches 384; Conservative 0; Mismatches 91; Indels 4; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II IIIMIIIIIIIIIIIIIMIIIIIIMIIIIMIIIIIIII 

Db 323 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 382 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

MM II Ml II Mill MUM MM II I III I III Mi II Mill MM II I III 

Db 383 AGGGGCTTTTACTT(^^CAAGCCCACA.GGCTATGGCTCC^GCATTCGGAGGGCACCTCAG 442 



Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

MMMINIMMIMM I II I II I II I II I II 1 1 II I Mill IMIMIMM 

Db 44 3 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 502 

Qy 181 TGCGCACCCCT(^GCCTGCCAAGT(^GCTCGCTCTGTCCGTGCC(^GCGC(^C^CCGAC 24 0 

II M II II II 1 1 1 1 I II MM MUM MMMMMMMIMM III 

Db 503 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCC(^GCGCa\CACTGAC 562 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAAC^ G 2 97 

IMIIIIIIII lllllll I I Ml III II IMIIMI MUM II I 

563 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACA^ 622 
2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 57 

MUM II I Ml III III Ml lllllll I II III Ml 1 1 llllll II II III II Ml 

623 AGAAGGAAAGGAAGTA(^TTTGAAGAACAC>AGTAGAGGAAGTGCANGAAACAAGACCTA 682 
358 CAGGATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Ml Mill II II III I MM I I I II lllllll II IIMM 

683 CAGAATGTANGAGGAGCCTNCCACGGAG(^GAANATGCCACATCACCGCANGATCCTTTG 742 
417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT^ 475 

II I Mill I M Ml Mill llllll II I 

743 CTGCTTGAGCAACCTGCANAAC7VTCGAAACACCTACCAAAT 801 



RESULT 11 

BF383724 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



BF383724 594 bp mRNA linear EST 27-NOV-2000 

602044632F1 NCI_CGAP_Li9 Mus musculus cDNA clone IMAGE : 4194295 5 f , 
mRNA sequence . 
BF383724 

BF383724.1 GI : 11365029 
EST. 

Mus musculus (house mouse) 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 594) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM9527 row: p column: 08 
High quality sequence stop: 58 9. 
Location/Qualifiers 

1. :5'94 

/ organ i.sm= "Mus musculus" 
/mol_type="mRNA" 
/strain= M FVB/N" 
/db_xref ="taxon: 10090" 
/ clone=" IMAGE : 4194295" 
. /lab_host= M DH10B (Tl phage-resistant) " 
/clone_l ib= " NCI_CGAP__Li 9 " 

/note="0rgan: liver; Vector: pCMV-SP0RT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirect ionally . Primer: Oligo dT. 
Average insert size 1.9 kb. Constructed by Life 
Technologies. Note: this is a NCI_CGAP Library." 
175 a 162 c 142 g 115 t 



Query Match 58.7%; Score 303.6; DB 10; Length 594; 

Best Local Similarity 80.7%; Pred. No. 4.9e-70; 

Matches 394; Conservative 0; Mismatches 84; Indels 10; Gaps 



Qy 
Db 



16 TGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCTTTTATTTC 75 

MMMIIMMMMMMIMMMMMMMMIMM MIMIIMM Ml 

107 TGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCGAGGGGCTTTTACTTC 166 



Qy 

Db 
Qy 
Db 

Qy 
Db 

Qy 
Db 



76 AACAAGCCCAC^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAGACAGGCATCGTGGAT 135 

MMIIMIIMM lllllllllllll lllllllll IIIIMIIIIIIII 1 1 1 1 1 1 

167 AACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTC^ 22 6 

136 GAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTATTGCGCACCCCTCAAG 195 

Mill MMMMMMIMIMM Mill MIIIIIIMI II II II II Ml 

227 GAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTACTGTGCCCCACTGAAG 286 
196 CCTGCCAAGT(^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC^TGCCC^AGACCCAG 255 

Ml I II MM llllll Ml Ml Ml III I Mill I III MM Mill II III 

2 87 CCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGACATGCCCAAGACTCAG 346 
256 AAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCA GAGAAGGAAAGGAAGT 312 

MM I I Ml Ml II I Mill II llllll II III III III MM I II 

347 AAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGGAGAAGGAAAGGAAGT 4 06 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



313 A(^TTTGAAGAACACAAGTAGAGGGAGTGC^GGAAA(^GAACTACAGGATGTA-GAAGA 371 

MM III II II Ml MINIUM Ml II Illllll II 1 1 Mill I Mill II II 

4 07 ACATTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTACAGAATGTAGGAGGA 4 66 
372 CCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCTGCACAGTTACC 431 

Ml I MM I M I II II llllll II I MM: I 

467 GCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTGCTGCTTGAGCAACCT 526 
432 TGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATCACATTTCAAAGATGG 485 

MUM I II III III II IMIMI M Mill lllllllll 

527 GCAAAACATCGAAACACCTACC7\AATAACAATAATA 58 6 

486 CATTTCCC 4 93 

II II 

587 GCATTGCC 594 



RESULT 12 

AA913900/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AA913900 527 bp mRNA linear EST 24-SEP-199E 

ol35g05.s2 Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 

IMAGE: 1525496 3' similar to gb : X57025_rnal INSULIN-LIKE GROWTH 

FACTOR IA PRECURSOR (HUMAN);, mRNA sequence. 

AA913900 

AA913900.1 GI:3053292 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 527) 

NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 870 Std Error: 0.00 
Seq primer: -4 0ml 3 fwd. ET from Amersham 
High quality sequence stop: 97. 

Locat ion/Qual i f iers 

1. .527 

/organism="Homo sapiens" 

/mol_type- M mRNA" 

/db__xref ="taxon: 9606" 

/clones" IMAGE: 1525496" 

/lab_host="DH10B" 

/clone_l ib= " Soares__NFL_T_GBC_Sl " 

/note="Organ: pooled; Vector: pT7T3D-Pac (Pharmacia) with 
a modified polylinker; Site_l : Not I; Site_2 : Eco RI ; 
Equal amounts of plasmid DNA from three normalized 
libraries (fetal lung NbHL19W, testis NHT, and B-cell 
NCI_CGAP_GCB1) were mixed, and ss circles were made in 
vitro. Following HAP purification, this DNA was used as 



BASE COUNT 
ORIGIN 



tracer in a subtractive hybridization reaction. The driver 
was PCR-amplif ied cDNAs from pools of 5,000 clones made 
from the same 3 libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 682632-68723 9, 
726408-728711, and 729096-7313 99. Subtraction by Bento 
Soares and M. Fatima Bonaldo. " 
125 a 134 c 119 g 149 t 



Query Match 58.0%; Score 299.8; DB 9; Length 527; 

Best Local Similarity 85.5%; Pred. No. 4.9e-69; 

Matches 413; Conservative 0; Mismatches 17; Indels 53; Gaps 



Qy 



Db 



5; 



3 9 TCTTCAGTTCGTGTGTGGAGAC7VGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTC 98 

I ! 1 1 1 ! I II 1 1 1 II I II I II 1 1 1 Ml 1 1 1 II I Ml I II III III MINIMI II . 

527 TCTTCAGTTCGTGTGTGGAGACAGGGGCTTTATTTACAACAAGCCCACAGGGTATGGCTC 4 68 



Qy 



Db 



99 CAGCAGTCGGAGGGCGCCTCAGACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGA 158 

M M M I II I M I II I II I 1 t I I I I I I I i I I f f I I f t I £ 1 f I t f I I I I I 1 I I I I E J I I I 

467 CAGCAGTCGGAGGGCGCCTAAGACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGA 4 08 



Qy 

Db 

Qy 
Db 

Qy 
Db 



15 9 TCTAAGGAGGCTGGAGATGTATTGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGT 218 

M I I I I I I ;M I I I I I Ml :| I: I 

4 07 TCTAAGGAGGCTGGAGATGTATTGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGT 34 8 
219 CCGTGCCC^GCGCCACACCGACATGCCauVGAC 278 

Ml MM Ml Ml Mill I MUM III II I MM II. 

34 7 CCGTGCCCAGCGCCACACCGACATGCCCAAGACCCAG 311 

27 9 CAAGAACACGAAGTCTCAGAGAAGGAAAGGAAGTAC^TTTGAAGAAC^ 338 

1 1 1 1 1 1 1 1 1 ! I f 1 1 11 1 1 1 1 1 MMIMIMM 

310 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGA 277 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



33 9 GTG CAGGAAACAAGAACTACAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAG 3 97 

MM II I III III IMM I III Ml III I Mill II I I III III II I MM MM 

276 GTGCAGGAAACAAGAACTACAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACAT 217 
3 98 GCCACCGCAGGACCCTTTGCTCTGCAC - AGTTACCTG -TAAACATTGGAATACCGGCCAA 4 55 

MMIMIMM I II II MM lllllllll Mill llllll Ml MM 

216 GCCACCGCAGGATCCTTTGCTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACOIA 157 
456 AAAATAAGTTTGATCACATTTCAAAGAT-GGCATTTCCCCCAATGAAAT^ 514 

INI III III Ml I llllll llllll III II Mill III III III lllllllll II 

156 AAAATAAGTTTGATAACATTTAAAAGATGGG CGTTTCC CC CAATGAAATACACAAGTAAA 97 

515 CAT 517 
Ml 

96 CAT 94 



RESULT 13 
AI876493/C 

LOCUS AI876493 642 bp mRNA linear EST 21-JUL-1999 

DEFINITION uj59b!0.xl Sugano mouse liver mlia Mus musculus cDNA clone 

IMAGE: 1924219 3' similar to gb : X57025_jrnal INSULIN-LIKE GROWTH 
FACTOR IA PRECURSOR (HUMAN); gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI876493 

AI 8 764 93 . 1 GI : 5550542 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 642) 

Marra,M., Hillier # L. # Kucaba,T., Martin, J., Beck,C. # Wylie,T., 

Underwood, K w Steptoe,M. # Theising,B., Allen, M. # Bowers, Y., Person 

,B., Swaller,T., Gibbons, M., Pape,D., Harvey, N., Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson, Y . , Cardenas, M. , McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel : 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI:980511 

Seq primer: custom primer used 
High quality sequence stop: 257. 

Location/Qualifiers 

1. .642 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL" 

/db_xref =" taxon: 10090" 

/clone=" IMAGE: 1924219" 

/sex=" female" 

/ dev_stage=" adult" 

/lab_host="DH10B" 

/clone_lib="Sugano "mouse liver mlia" 

/no te=" Organ: liver; Vector: pME18S-FL3; Site_l: Drain 
(CACTGTGTG) ; Site_2 : Drain (CACCATGTG); 1st strand cDNA 
was primed with an oligo(dT) primer 

[ATGTGGCCTTTTTTTTTTTTTTTTT] ; double- stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Drain sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science)'. Custom primers for 
sequencing: 5* end primer CTTCTGCTCTAAAAGCTGCG and 3' end 
primer CGACCTGCAGCTCGAGCACA . " 
127 a 154 c 175 g 185 t 1 others 



Query Match 56 . 0% ; 

Best Local Similarity 78.9%; 
Matches 3 97; Conservative 



Score 289. 6; DB 9; 
Pred. No. 2.7e-66; 
0; Mismatches 95; 



Length 642; 
Indels 11; Gaps 



4; 



Qy 2 GACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACA 61 

INI Mill II II MMIM II Ml II MINI Mill III MM Mill II I 

Db 503 GACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGGTCGTGTGTGGACCGA 444 

QY 62 GGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAGA 121 

II 1 1 1 1 1 1 1 I M M 1 1 II II M 1 1 1 M I II I M II II I I 1 1 M 1 1 1 I II 1 1 II 

Db 443 GGGGCTTTTTCTTCAACAAGGCCACAGGCTATGGCTCCAGCATTTGGAGGGCACCTCAGA 384 

Qy 122 CAGGCATCGTGGATGAGTGCTGCTTCCGG-AGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Ml II 1 1 1 1 1 ! 1 1 1 1 1 1 1 f 1 1 1 1 1 1 i i II II Ml lllllll 

Db 383 CAGTCAATGTGGATGAGTGTTGCTTCCGGAAGCTGTGATCTGAGAAGACTGNAGATGTAC 324 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M M II II Mill I I II MM MIMI MM III II Mill II I II III 

Db 323 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCAGACTGAC 2 64 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCA---G 2 97 

II MM Ml II MMIM 1 I III III II II III Ml I Mill II I 

Db 2 63 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 2 04 

Qy 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

I M M I I II II I I II I I I I I I II I I I || || || || I | || | I II || M II I I II I II III 
Db 2 03 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 144 

Qy 3 58 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Ml Mill II II III I MM Ml I II IMMIIMI MIMI 

Db 143 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 84 
Qy 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 470 

M I MIMI I II III III II MMIM II 

Db 83 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAACAATAATAAGTCCAATA 24 

Qy 4 71 ACATTTCAAAGATGGCATTTCCC 4 93 

Mill MINIMI II II 

Db 23 ACATTACAAAGATGGGCATTTCC 1 



RESULT 14 

AW495481/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



AW495481 499 bp mRNA linear EST 24-FEB-2000 

UI-M-BH3-auy-g-ll-0-UI .si NIH_BMAP_M_S4 Mus musculus cDNA clone 
UI-M-BH3-auy-g-ll-0-UI 3', mRNA sequence. 
AW495481 

AW4 95481 . 1 GI : 7065762 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 499) 

Bonaldo,M.F. , Lennon,G. and Soares. # M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 
97044477 
8889548 

Contact: Chin, H 



Craniata ; Vert ebrata ; Eutel eostomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSC 9643, Bethesda, MD 

20892-9643, USA 

Tel: 301 443 1706 

Fax: 301 443 9890 

Email: mEST@mail.nih.gov 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonaf ide poly A 
tail. The sequence tag present in the cDNA between the NotI site 
and the oligo-dT track served to identify it as a clone from the 
normalized pineal glands library cDNA Library Preparation: M.B. 
Soares Lab Clone distribution: Researchers may obtain BMAP cDNA 
clones from RESEARCH GENETICS. It should be noted that Bento Soares 
is generating a small number of additional specialized 
non-redundant arrays of BMAP cDNAs whose availability will be 
considered under appropriate and limited collaborative arrangements 
Seq primer: M13 Forward 
P0LYA=Yes . 

FEATURES Location/Qualifiers 
source 1 . . 499 

/organism="Mus musculus" 
/ mo 1 _ t yp e - " mRNA " 
/s trains "C57BL/6 J" 
/db_xref="taxon: 10090" 
/clone= n UI-M-BH3-auy-g-ll-0-UI" 
/dev_stage="27-32 days" 
/lab_host="DH10B (Life Technologies)" 
/ c 1 o n e__ 1 i b = " N I H_B MA P_M_ S 4 " 

/not e=" Vector: pT7T3D-Pac (Pharmacia) with a modified 
polylinker; SiteJL : Not I; Site_2 : Eco RI ; The 
NIH_BMAP_M_S4 library is a subtracted library of a series, 
ultimately derived from a. mixture of individually tagged 
normalized libraries from ten regions of the mouse brain 
(cerebellum, brain stems, olfactory bulbs, hypothalamus, 
cortex, amygdala, basal ganglia, pineal gland, striatum, 
hipoccampus) after a series of subtractions to reduce the 
representation of cDNAs from which ESTs had already been 
generated. The following serially subtracted libraries 
were generated in this process: NIH_BMAP_M_S4 , 
NIH_BMAP_M_S3 . 3 , NIH_BMAP_M_S3 . 2 , NIH_BMAP_M_S3 . 1 , 
NIH_BMAP_M_S2, NIH_BMAP_M_S1 . The subtracted library 
(NIH_BMAP_M_S4) was constructed as follows : PCRamplified 
cDNA inserts from NIH__BMAP_M_S3 . 3 , NIH_BMAP_M_S3 . 2 , and 
NIH_BMAP_M_S3 . 1 clones from which 3 ' ESTs had been derived 
was used as a driver in a hybridization with a pool of 
the NIH_BMAP_M_S3 .3, NIH_BMAP_M_S3 . 2 , and NIH_BMAP_M_S3 . 1 
libraries in the form- of single- stranded circles. The 
remaining single- stranded circles (subtracted library) 
was purified by hydroxyapatite column chromatography, 
converted to double-stranded circles and electroporated 
into DH10B bacteria (Lif eTechnologies) to generate the 
NIH_BMAP_M_S4 library. This procedure has been previously 
described (Bonaldo, Lennon and Soares, Genome Research 
6:791-806, 1996) 
TAG_L I B =N I H__BMAP_M_S 4 
TAG_TISSUE=pineal -glands 



TAG_S EQ- CAGAC " 
BASE COUNT 86 a 112 c 124 g 177 t 

ORIGIN 

Query Match 55.6%; Score 287.4; DB 9; Length 499; 

Best Local Similarity 80.8%; Pred. No. 9*7e~66; 

Matches 387; Conservative 0; Mismatches 81; Indels 11; Gaps 4 
Qy 50 TGTGTGGAGACAGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGA 109 

i lllllllllll IIIIIIIIMIIIIIII i Mill 

Db 4 99 TGTGTGGACCGAGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGA 44 0 

Qy HO GGGCG C CT CAGACAGG CAT CGTGGATGAGTGCTG CTT CCGGAG CTGTGAT CTAAGGAGGC 169 

1 1 1 1 III;! MINI lllllllllll IMIIMIMMIMMMI MM. I 

Db 43 9 GGGCACCTCAGACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGAC 38 0 

Qy 170 TGGAGATGTATTGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGC 22 9 

I 1 1 I M I I I I I I I II I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
Db 37 9 TGGAGATGTACTGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGC 32 0 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



23 0 GC(^CACCGACATGCC(^GACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGA 28 9 

I IN I II 1 1 Ml 1 1 III II 1 1 II III II I I III III II II I II III MM 

319 GCCACACTGACATGCC(^GACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGA 260 
290 AGTCTCA GAGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGA 346 

II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 

25 9 AGCTGCAAAGGAGAAGGAAAGGAAGTACT^TTTGAAGAAC^ 200 
34 7 AACAAGAACTACAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGC 4 05 

I II III I I Mill Mill II M III I MM I II I II llllll 

19 9 AACAAGACCTACAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATC 14 0 

406 AGGACCCTTTGCTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA- -AAAAA 459 

MM 1 1 II Ml I I llllll I M III III MM 

13 9 AGGATCCTTTGCTGCTTGAGCAACCTGCTyy^ 8 0 

460 TAAGTTTGATCACATTTCAAAGAT-GGCATTTCCCCCAATGAAATA^ 517 

Mill II Mill IIIMM MIMMIIIIIIIIIIIIII MIMMIMM 

79 TAAGTCCAATAACATTACAAAGATGGGCATTTCCCCCAATGAAATATACAA 21 



RESULT 15 

AI169770/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AI 169770 468 bp mRNA 

EST215669 Normalized rat liver, 
RLIAT07 3' end, mRNA sequence. 
AI169770 

AI169770.1 GI:3709810 
EST. 

Rattus sp . 
Rattus sp-. 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
Rattus . 

1 (bases 1 to 468) 
Lee , N . H . , Glodek,A. 



linear EST 20-JAN-199S 
Bento Soares Rattus sp . cDNA clone 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; 



Kerlavage, "A.R. and Adams, M.D. 



Chandra, I . , Mason, T.M. , Quackenbush, J. , 



TITLE Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 

Gene Index 
JOURNAL Unpubl i shed 
COMMENT Other_ESTs: TC50779 

Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel : (301) -838-3529 
Fax: (301) -838-0208 
Email: nhlee@tigr.org 
Seq primer: M13-21. 
FEATURES Location/Qualifiers 
source 1. .468 

/organism="Rattus sp." 
/mol_type= M mRNA" 

/db_xref= n ATCC (inhost) : 2027570" 
/db_xref ="taxon: 10118" 
/clone="RLIAT07" 

/clone__lib="Normalized rat liver, Bento Soares" 
/note="0rgan: liver; Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: Not 1 11 

BASE COUNT 85 a 115 c 119 g 149 t 

ORIGIN 

Query Match 53.4%; Score 276; DB 9; Length 468; 

Best Local Similarity 80.5%; Pred. No. le-62; 

Matches 375; Conservative 0; Mismatches 80; Indels 11; Gaps 4 
Qy 63 GGGCTTTTATTTC^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAGAC 122 

MINIMI MIIMMMMIMM 1 1 ! 1 1 M 1 1 1 1 1 1 IIIMIMI II Mill 

Db 4 68 GGGCTTTTACTTCAACAAGCCC^CAGGCTATGGCTCCAGCATTCGGAGGGCACCAaVGAC 4 09 

Qy 123 AGG CATCGTGGATGAGTGCTG CTT CCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTATTG 182 

Mill 1 1 Mill MM MM Mill MINI II 1 1 MUM I Ml II I III II 

Db 4 08 GGGCATTGTGGATGAGTGTTGCTCCCGGAGCTGTGATCTGAGGAGGTTGGAGATGTACTG 34 9 

Qy 183 CG(^CCCCTCAAGCCTGC(^GTC^GCTCGCTCTC^ 242 

II M M I Ml II I I II I MM Ml II MM i I .1 Mill 

Db 34 8 TGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCC^ 289 
Qy 243 GCCCAAGACCCAGAAGTATC^GCCCCCATCTACCAACAAGAA GAG 299 

MMMMI I Mill I III I MINI II II MM I I MM II Ml 

Db 28 8 GCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^ 229 

Qy 3 00 AAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^GGAAAC^GAACTA(^ 359 

Mill II I III Mill II MM Mill MMMMI MM IIIMIMI II I Mill 

Db 22 8 AAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA^ 169 

Qy 3 60 GGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCT 418 

I Mill M M Ml I Mill I II I II II MUM M I Mill M 

Db 168 GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCT 109 
Qy 419 CTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATCAC 472 

I . II II 1 1 II II 1 1 1 II 1 1 MMMMI II I 

Db 108 GCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAAT 4 9 

Qy 4 73 ATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



Db 4 8 ATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACAT 3 



Search completed: December 13, 2003, 07:29:47 
Job time : 1690.26 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



December 13, 2003, 05:41:20 ; Search time 2309.97 Seconds 

(without alignments) 
9156.102 Million cell updates/sec 

US-09-852-261-1 
517 

1 ggaccggagacgctctgcgg. tgaaatacacaagtaaacat 517 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 

1 : gb_ba : * 

2 : gb_htg : * 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_ov : * 

6 : gb_pat : * 

7 : gbjph : * 

8 : gb_pl : * 

9 : gb_pr : * 
1 0 : gb_ro : * 

11: gb_StS:* 

12: gb_sy:* 
13: gb_un:* 

14 : gb_vi : * 

1 5 : em_ba : * 
16: em_fun:* 
17: em_hum:* 
18: em_in:* 
1 9 : em__mu : * 
2 0 : em_om : * 
2 1 : em_or : * 
2 2 : em_ov : * 
23: em_pat:* 
24: em_ph:* 
25: em_pl:* 
26: em_ro : * 
27 : em sts : * 



28 


: em 


un: * 


29 


em 


vi : * 


30 


em 


htg hum: * 


31 


em 


htg inv : * 


32 


em 


htg other : * 


33 


em 


htg mus : * 


34 


em 


htg pin : * 


35 


em_ 


htg rod:* 


36 


em 


htg mam : * 


37 


em 


_htg_vrt : * 


38 


em 


sy : * 


39 


em_ 


_htgo_hum: * 


40 


em_ 


_htgo__mus : * 


41 


em 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o, 
o 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


517 


100 


. 0 


517 


6 


AX147742 


AX147742 Spmipnrp 


2 


517 


100 


. 0 


517 


6 


AX300779 


AX300779 Senupnrp 


3 


467.4 


90 


.4 


523 


6 


AX147746 


AX147746 ^pmipnrp 


4 


467.4 


90 


.4 


523 


6 


AX300783 




5 


377.2 


73 


. 0 


471 


6 


AX147754 




6 


377.2 


73 


. 0 


471 


6 


AX300791 


AX3 00791 Sequence 


7 


.355 .4 


68 


. 7 


444 


9 


HSU40870 


U40870 Human alter 


8 


344 .2 


66 


.6 


616 


9 


HSIGF1A 


X56773 H. sapiens m 


9 


344 .2 


66 


. 6 


7260 


6 


AX375028 


AX375028 Sequence 


10 


344 .2 


66 


. 6 


7260 


6 


AX411095 


AX411095 Sequence 


11 


344 .2 


66 


.6 


7260 


9 


HSIGFACI 


X57025 Human IGF-I 


12 


342.6 


66 


.3 


666 


6 


A29119 


A29119 H. sapiens I 


13 


342.6 


66 


.3 


725 


9 


HSIGFI 


X00173 Homo sapien 


14 


342 . 6 


66 


.3 


728 


9 


HUMGFII 


M2 9644 Human insul 


15 


342.6 


66 


.3 


1076 


9 


HUMIGFI 


M27544 Human insul 


16 


341 


66 


0 


620 


6 


108370 


10837 0 Sequence 2 


17 


331.6 


64 


1 


1536 


10 


BC012409 


BC012409 Mus muscu 


18 


330 


63 


8 


798 


10 


RNIGFI2 


X06108 Rat mRNA (c 


19 


330 


63 


8 


958 


10 


RNIGFI1 


X06107 Rat mRNA (c 


20 


326.8 


63 


2 


710 


10 


RATIGFIA 


M15480 Rat insulin 


21 


•325.2 


62 


9 


539 


6 


AX147744 


AX147744 Sequence 


22 


325.2 


62 


9 


539 


6 


AX300781 


AX300781 Sequence 


23 


.318.2 


61 


5 


651 


10 


MMIGFIBR 


X044 82 Mouse mRNA 


24 


308 . 6 


59 


7 


730 


9 


HSIGF1B 


X56774 H. sapiens m 


25 


308 . 6 


59 


7 


1094 


9 


HUMGFIB 


Ml 1568 Human insul 


26 


308.6 


59. 


7 


1136 


6 


E01349 


E0134 9 cDNA encodi 


27 


308.6 


59. 


7 


1136 


6 


108009 


108009 Sequence 3 


28 


294 .4 


56. 


9 


432 


4 


AF022961 


AF022961 Oryctolag 


29 


286.4 


55 . 


4 


3599 


6 


BD063790 


BD063790 Insulin-1 


30 


286.4 


55. 


4 


3599 


6 


BD069041 


BD069041 Treatment 


31 


286.4 


55 . 


4 


3600 


6 


BD063789 


BD063789 Insulin-1 


32 


286.4 


55. 


4 


3600 


6 


BD069040 


BD069040 Treatment 


33 


285.4 


55 . 


2 


612 


9 


HUMIGFIB 


M3 7484 Human insul 



34 


283 , 


.4 


54 . 


. 8 


888 


4 


ECU8 5272 


U85272 Equus cabal 


35 


281 . 


.2 


54 . 


.4 


532 


4 


SSILGF1M 


X17492 Porcine mRN 


36 


281 . 


.2 


54 . 


. 4 


567 


4 


PIGGFIIA 


M31175 Pig insulin 


37 


276 


53 , 


. 4 


1284 


4 


BTILGF1A 


X15726 Bovine mRNA 


38 


271. 


.2 


52. 


. 5 


978 


4 


GOT IGF I 


D11378 Goat mRNA f 


39 


271. 


.2 


52. 


.5 


978 


6 


E05279 


E05279 DNA encodin 


40 


267 . 


.6 


51. 


. 8 


836 


10 


CPIGF1 


X52951 Guinea pig 


41 


264 . 


.8 


51. 


.2 


747 


4 


SHPIGFIA6 


M3173 5 Sheep insul 


42 


264 . 


.8 


51. 


,2 


790 


4 


SHPIGFIA21 


M31734 Sheep insul 


43 


264 . 


.8 


51. 


.2 


1015 


4 


SHPIGFIA46 


M3173 6 Sheep insul 


44 


264 . 


.8 


51. 


.2 


1107 


4 


SHPIGFIA1 


M30653 Sheep insul 


45 


258 . 


,4 


50. 


. 0 


317 


6 


AX147750 


AX147750 Sequence 



ALIGNMENTS 



RESULT 1 
AX147742 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AX14.7742 517 bp DNA linear PAT 31-AUG-2001 

Sequence 1 from Patent WO0136483. 

AX147742 

AX147742.1 GI:14346787 

Homo s ap i en s { human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Goldspink,G.R. and Johnson, I. R. 

Use of the insul in- like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 1 25-MAY-2001; 
University College London (GB) 
- • Location/Qualifiers 
1. .517 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 96 06" 
<1. .333 

/not e=" unnamed protein product" 

/codon_start=l 

/protein_id=" CAC41175 .1" 

/db_xref="GI : 14346788" 

/ db_xref =" REMTREMBL: CAC41175" 

/ translat ion= "GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGI V 
DECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRK 
GSTFEEHK" 
150 a 130 c 139 g 98 t 



Query Match 100.0%; Score 517; DB 6; Length 517; 

Best Local Similarity 100.0%; Pred. No. 4.4e-15.5; 

Matches 517; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Ml Mill III II I MINIM Mill III Ml Mill INN Ml Ml II I II III Ml 



Db 


1 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 
1 1 1 1 l l l l I I I I i I I I I I I I i i i i i i i i i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 N II II II 1 II 1 II II II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


61 


120 


Qy 


121 


ACAGGCAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 

1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 I 1 1 1 ! 1 1 1 1 1 I I 1 l l l l i i i i i 

II 1 1 1 1 M 1 1 1 II 1 II 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGAT CTAAGGAGG CTGGAGATGTAT 


180 


Db 


121 


180 


Qy 


181 


TGCGCACCCCTC^GCCTGCC^GTCAGCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 

i 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 i 1 I 1 1 1 1 1 1 I I I 1 I 1 I i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1| 1 1| 1 1 1 1 1 1 1 1 1| 1 1| 1 1 

TGCGCACCCCTCAAGCCTGCCAAGTC^GCTCGCTCTGTCCGTGCCCAGCGCC^CACCGAC 


240 


Db 


181 


240 


Qy 


241 


ATGCCCAAGACCt^GAAGTATCAGCCCCCATCTACCAAC^ 

I i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i t i i i i i t i i t i i i i i i 

M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGCCCAAGACC(^GAAGTATC^GCCCC(^TCTACC^ 


300 


Db 


241 


300 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGGAGTGCAGGAAAO^GAACTACAG 
1 1 i i I I I I I I I i i i i i i i i i i i t t i i i i i t i i i t i I i i i i i i i i i i i i i i i tiii 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1| 1 1| 1 1 1 1 1| 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 

AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 


360 


Db 


301 


360 


Qy 


361 


GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 
I i t i i i i i it i i i i i i i i i i i t i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i , 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II | 1 1| 1 1 | 1 1 | | | 1 1 | | | | 1 1 | 

GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 


420 


Db 


361 


420 


Qy 


421 


GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAG 

IIMMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIMIMIII 

GCAGAGTTACCTGTAAACATTGGAATACCGGCCAAAAAAT^^ 


480 


Db 


421 


480 


Qy 


481 


GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 

GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


481 





RESULT 2 
AX300779 

LOCUS AX300779 517 bp DNA linear PAT 30-NOV-2001 

DEFINITION Sequence 1- from Patent WO0185781. 
ACCESSION AX300779 

VERSION AX300779.1 GI: 17382060 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Goldspink,G.D. and Terenghi , G . B . 

TITLE Repair of nerve damage 

JOURNAL Patent: WO 0185781-A 1 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical Research 

Trust (GB) 

FEATURES Locat ion/Qual i f iers 

source 1. .517 

/ o r ga n i s m= " Homo sapiens" 
/mol_type=" genomic DNA" 
/db xref ="taxon: 9606" 



CDS <1. .333 

/not e= M unnamed protein product" 
/codon_start=l 
/protein_id="CAD13 04 0 . 1" 
/db_xref="GI : 17382061" 

/ 1 rans 1 a t i on= " GPETLCGAEL VT)ALQFVCGDRGF YFNKPTG YGSSSRRAPQTG I V 
DECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRK 
GSTFEEHK" 

BASE COUNT 150 a 130 c 139 g 98 t 

ORIGIN 

Query Match 100.0%; Score 517; DB 6; Length 517; 

Best Local Similarity 100.0%; Pred. No. 4.4e-155; 

Matches 517; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 


l 
l 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

IIIIIMMIIIIMMIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIMIIMI 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 
60 


Qy 


61 


AGGGGCTTTTATTTCAAC^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 

Mill Ml II Ml II III III III MM Mil II II III MM II III II MM II II II 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


61 


120 


Qy 


121 


ACAGG CATCGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 

II Mill III III III II MM Ml Mill Mill II III IIMIII II Mill MUM 

ACAGG CATCGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 


180 


Db 


121 


180 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

1 1 1 1 1 1 1 1 1 1 1 M 1 II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 
1 1 1 1 1 1 1 1 !! 1 1 1 1 I 1 1 1 1 1 M 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 j M 1 1 | | | | | | | | | | | j j | | | | I 

TGCGCACCCCTC^GCCTGCC^AGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


181 


240 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAAC^ 

M M II 1 M 1 1 1 II 1 1 M 1 M II 1 II 1 1 M M 1 M 1 1 1 II 1 1 II II 1 M 1 1 II 1 M M II 

ATGCCCAAGACC(^GAAGTATCAGCCCCCATCTACCAACAA 


300 


Db 


241 


300 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAO^GAACTACAG 

M 1 M 1 M 1 M 1 II 1 II 1 1 1 M 1 1 1 M 1 1 1 II 1 II M M II II 1 II 1 II M II M 1 1 1 1 1 

AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGO^GGAAACAAGAACTACAG 


360 


Db 


301 


360 


Qy 


361 


GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 

Mill Ml MM III III Mill IIMIII Mill II Mill MUM II III Ml MM 

GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 


420 


Db 


361 


420 


Qy 


421 


GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTT^^ 

M 1 M 1 1 II II 1 II 1 II 1 II II 1 1 II 1 II 1 II 1 1 1 M II II 1 II II 1 II II II 1 1 II 1 M 

GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAG 


480 


Db 


421 


480 


Qy 


. 481 


GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MIIIIMIIIIMIIIIIIMMMIIIIIMIMI 

GATGGCATTTCCCCCT^TGAAATACACAAGTAAACAT 517 




Db 


481 





RESULT 3 
AX147746 

LOCUS AX147746 .523 bp DNA linear PAT 31-AUG-2 001 

DEFINITION Sequence 5 from Patent WO0136483. 
ACCESSION AX147746 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AX14 7746. 1 GI : 1434 6791 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink,G.R. and Johnson, I. R. 

Use of the insulin- like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 5 25-MAY-2001; 
University College London (GB) 

Location/Qualifiers 

1. .523 

/organism^" Oryctolagus cuniculus" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9 986" 
<1. .336 

/not e= "unnamed protein product" 

/codon_start=l 

/protein_id= " CAC4 1177 . 1 " 

/db_xref="GI : 14346792" 

/ db_xr e f = " REMTREMBL : CAC4 1177" 

/ 1 rans 1 a t i on= " GPETLCGAELVDALQFVCGDRGF YFNKPTG YGSS SRRAPQTG I V 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRR 
KGSTFEEHK" 
154 a 129 c 142 g 98 t 



Query Match 90.4%; Score 467.4; DB 6; Length 523; 

Best Local Similarity 96.2%; Pred. No. 4.4e-139; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 



Qy 
Db 



2; 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 1 f 1 1 f f 1 1 1 1 1 1 1 1 IMIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIII 

1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

I MINI MINIM MINIM I INN II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II II I 

61 AGGGGCTTTTATTTC^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 
121 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

M II M I II II M II 1 1 II II I M II II M 1 1 II II 1 1 1 1 1 NNNNNNINN 

121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 
181 TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 24 0 

N IN I II INI II II N III INI INN NNNNNIINNININN 

181 TGTG^CCCCTCAAGCCGGOU^GGCAGCC^^^ 24 0 

241 ATGCCCAAGACCCA.GAAGTATCAGCCCCC^TC - -G 2 97 

NINIINN ; i | | | | | | | M | | 

241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAAC 3 00 

2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 3 57 

I I i 1 1 J 1 1 1 f i r i ( 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 f j i i j 1 1 1 1 1 1 1 f 1 1 ; i f 1 1 1 1 [ i 

301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 



Qy 358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Mlllllll. M 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 M I II 1 1 1 M 1 1 1 1 : 1 1 1 M 1 1 1 1 1 1! II I 

Db 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG. 42 0 

Qy 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAA^ 4 76 

I II II I II MM II II I II MM II I II MM II III II 1 1 M ill II MM IMIIII I 

Db 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 48 0 

Qy 477 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 !!! I II ! 1 1 1 1 II I! 1 1 1 1 1 II M I 

Db 481 CAAAGATGG CATTT C C C C CAATGAAATACACAAGTAAACAT 521 



RESULT 4 
AX300783 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AX300783 523 bp DNA linear PAT 30-NOV-2001 

Sequence 5 from Patent WO0185781. 

AX300783 

AX300783.1 GI:17382064 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink,G.D. and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 5 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Location/Qualifiers 

1. .523 

/organism="Oryctolagus cuniculus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9986" 
<1. .336 

/ not e= "unnamed protein product" 
/codon_start=l 
/protein__id="CAD13 042 . 1" 
/db_xref="GI : 17382065" 

/ trans lation= "GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 

DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRR 
KGSTFEEHK" 

154 a 129 c 142 g 98 t 



Query Match 90.4' 
Best Local Similarity 96.2^ 
Matches 501; Conservative 



Score 467.4; DB 6; 
Pred. No. 4.4e-139; 
0; Mismatches 16; 



Length 523; 



Indels 



4 ; Gaps 



2; 



Qy 

Db 

Qy 
Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

Ml IN 1 1 MINI 1 1 II 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 f ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 



60 



60 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

1 1 M 1 1 M I II II 1 1 1 M M M M I M II II I II 1 1 1 1 M 1 1 1 II 1 1 1 II I 1 1 II 1 1 

61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 



Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

illlllllllllMIMIIIIIIIIIIIIIIIIIIIIIIM I INI I II III Ml Ml 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 240 

II MIIIMIIMIII II III 1 1 1 1 Mill IMIIIIIIIIIIIIIIIMIIII 

Db 181 TGTGC^CCCCTCAAGCCGG(^AAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 
Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTC^ G 2 97 

1 1 1 1 1 1 1 1 1 1 1 II M I II II II II I IIIIIIIIMIIIIMI I MIMIIII I 

Db 241 ATGCCCAAGACTC^GAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

Qy 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

MINI MINIM II MINI III Ml Ml HI IMIIIIIIIIIIIIIIIMIIII || 

Db. 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

IIMM Ml Ml Ml Ml IIMM lllllllll Ml IIMM III MUM IIMM 1 1 

Db 3.61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

Qy 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 4 76 

i M 1 1 M MIMI II I M I ! ! 1 1 1 1 III III MUM IIMM i 

Db 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCIACATTT 480 

Qy 4 77 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

! 1 1 1 1 i 1 1 1 1 Ml IIMM IIMM III IIMM MM Ml 

Db 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 



RESULT 5 
AX147754 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



AX147754 471 bp DNA linear PAT 08-JUN-2001 

Sequence 13 from Patent WO0136483. 

AX147754 

AX147754 .1 GI : 14348552 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink,G.R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 13 25-MAY-2001; 
University College London (GB) 

Locat ion/Qual i f iers 

1. .471 

/organism^" Oryctolagus cuniculus" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9986" 
<1.- .318 

/not e= "unnamed protein product" 
/codon_start=l 
/protein_id="CAC41264 .1" 
/db xref="GI : 14348553" 



/ trans lat ion= " GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGI V 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKEVHLKNTSRGSAGNKN 
YRM" 

BASE COUNT 132 a 118 c 131 g 90 t 

ORIGIN 

Query Match 73.0%; Score 377.2; DB 6; Length 471; 

Best Local' Similarity 87.8%; Pred. Mo. 5.4e-110; 

Matches 455; Conservative 0; Mismatches 13; Indels 50; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II IIMMII IN Mill 1 1 II II INN I II III II III II I III II III MINN I 

Do 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

QY 61 AGGGGCTTTTATTTCAACAAGCCC^CAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

II INN MINI II III II III 1 1 INI II IN INN II I INN II 1 1 INI 1 1 

Db ■ 61 AGGGGCTTTTATTTt^CAAGCCC^CAGGATACGGCTCCAGC^GTCGGAGGGC^CCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

1 1 II II II II IN II II I II III 1 1 III II III IN II III 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC. 18 0 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGT(^ 24 0 

N INI III III II II II III INI II II I IN II II III II INI INN IN 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGC^ 24 0 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCAT 3 00 

NINIINN III 

Db 241 ATGCCCAAGACTCAG _ 2 55 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAG 3 60 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 M I r 1 1 1 1 1 1! 1 1 M 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M r I 

Db 256 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTC 311 

Qy 3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

I Nil I II II II II II I N INI II II II II II II N II I II II I II II II II 1 1 II 1 1 

Db 312 GATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 371 

Qy 420 TGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGT^ 479 

I I f I M 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 

Db 372 TGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCA^ 431 

Qy 48 0 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 1 II II 1 1 1 1 1 1 1 N II N N 1 1 1 1 II I II II II II II 

Db 432 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 4 69 



RESULT 6 
AX300791 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



AX300791 
Sequence 13 
AX300791 
AX300791 . 1 

Oryctolagus 
Oryctolagus 



471 bp 
from Patent WO0185781. 



DNA 



1 inear 



GI : 17382072 



cuniculus 
cuniculus 



(rabbit) 



PAT 30-NOV-2001 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus . 
1 

Goldspink,G.D. and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 13 15-NOV-2001; 

University College London {GB) ; East Grinstead Medical Research 
Trust (GB) 

Locat ion/Qualif iers 

1. -471 

/organism^ "Oryctolagus cuniculus" 
/mol_type=" genomic DNA" 
/db__xref = 11 1 axon: 9 986" 
<1. .318 

/not e=" unnamed protein product" 
/codon_start=l 
/protein_id="CAD13 04 5. 1" 
/db_xref = "GI : 17382073" 

/translati on= " G P ETLCGAELVDALQFVCGDRGF YFNKPTG YGS S SRRAPQTG I V 

DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKEVHLKNTSRGSAGNKN 
YRM" 

132 a 118 c 131 g 90 t 



Query Match 
Best Local Similarity 
Matches 455; Conservative 



73.0%; Score 377.2; DB 6; Length 471; 
7.8%; Pred. No. 5.4e-110; 

0; Mismatches 13; Indels 50; Gaps 



2; 



Qy 


l 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


i 


IMIIIIMIIIIIIIIIII IIIMIIIIMIMIIIIIIMMIIIIIIIIIIIIMI 

GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCAC^GGGT 


120 


Db 


61 


IMMMMMIMMMIMMMMM II llllllllllllllllllll MINI 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCT 


120 


Qy 


121 


ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


121 


Mlllllllllllllllllll llllllllllllllllllll IMIMMMIIMIII 

ACAGG CATCGTGGATGAGTG CTG CTT CCGGAGCTGTGAT CTGAGGAGG CTGGAGATGTAC 


180 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


181 


II -.111.11111 MINI II III MM Mill Mill MM III III Ml MM II 

TGTGCACCCCTCAAGCCGGOUUVGGCAG 


240 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAG 

1 M M II 1 1 1 1 III 


300 


Db 


241 




255 


Qy 


301 


AGGAAAGGAAGTACA.TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAC^ 


360 


Db 


256 


M M M II 1 II II II II II II 1 1 II II 1 1 II 1 1 1 1 1 M M 1 1 M II 1 1 II 1 II 1 II 

AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGOIGGAAACAAGAACTACAG 


311 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 


419 


Db 


312 


MUM II 1 II 1 1 1 II II 1 M 1 M 1 1 1 M 1 1 M M 1 1 1 II 1 II 1 II II II II M II II 1 

GATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 


371 


Qy 


420 


TGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAAT^ 


479 



IMMIIIiMllllllllllllllllillMIIIIIIMIIMIMIIIIIMIIIIII 



Db 



372 TGC^CAGTTACCTGTAAACAT^ 43 1 



Qy 48 0 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 1 II 1 1 III III 1 111.11 Mill III Mill Mill 1 1 

Db 432 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 4 69 



RESULT 7 
HSU40870 

LOCUS HSU40870 444 bp mRNA linear PRI 05-APR-1996 

DEFINITION Human alternatively spliced human insulin-like growth factor-I 

(IGF-I) mRNA, partial cds . 
ACCESSION U4 0870 
VERSION U40870.1 GI: 1100902 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 444) 

AUTHORS Chew,S.L., Lavender,P., Clark, A. J. and Ross,R.J. 

TITLE An alternatively spliced human insulin-like growth factor-I 

transcript with hepatic tissue expression that diverts away from 
the mitogenic IBE1 peptide 
JOURNAL Endocrinology 136 (5) , 1939-1944 (1995) 
MEDLINE 95237119 
PUBMED 7720641 
REFERENCE 2 (bases 1 to 444) 
AUTHORS Chew,S.L. 
TITLE Direct Submission 

JOURNAL Submitted (20-NOV-1995) Shern L. Chew, Endocrinology, St 

Bartholomew's Hospital Medical College, West Smithfield, London, 
EclA 7Be, UK 
FEATURES Location/Qualifiers 
source 1. .444 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon: 9606" 
/clone="pC4" 
/tissue_type=" liver" 
gene l. .444 

/gene=" IGF-I" 
CDS <i. .420 

/ gene=" IGF-I " 

/note="alternatively spliced; previously, exon 5 and 6 
were thought to be mutually exclusive; this transcript 
splices from exon 5 into exon 6; the alternatively spliced 
transcript would continue with exon 5 to the polyA signal" 
/codon_start=l 

/product =" insulin- like growth factor-I" 
/protein_id="AAA96152 . 1" 
/db_xref="GI : 1100903" 

/translation- " LKVKMHTMS S SHLF YLALCLLTFTS SATAGP ETLCGAEL VDALQ 
FVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPAKSARSV 
RAQRHTDMPKTQKYQPPSTNKNTKSQRRKGSTFEERK" 
exon 1 . . 6 

/gene=" IGF-I" 



exon 



exon 



exon 



exon 



BASE COUNT 
ORIGIN . 



/ number =1 
7. .163 
/gene=" IGF- 
/number=3 
164. .345 
/gene="IGF- 
/ number =4 
346. .394 
/gene="IGF- 
/number=5 
395. .420 
/gene="IGF-I" 
/ number =6 
107 a 125 c 125 



-I" 



-I" 



M t 



Query Match 68.7%; Score 355.4; DB 9; Length 444; 

Best Local Similarity 99.7%; Pred. No. 5.7e-103; 

Matches 356; Conservative 0; Mismatches 1; Indels 0; Gaps 



Qy 
Db 



0; 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 IN 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 l| 1 1 1 1 II | ,! 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 M 1 1 

88 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 147 



QY 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 
148 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 2 07 
121 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 180 

MMMMMMMMMIMMMMMIMMIMMMMMMMMMMMMI 

2 08 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 267 

181 TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

2 68 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 327 
241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAAC 300 

IMMMIIMIMIMMMMMMMMMIMMMMMMMMIMMIMM 

328 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAA 387 

3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Mill III II I II Ml Ml III II I MM I Mill llllllllll Mill II MM 

3 8 8 AGGAAAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGC7VGGAAACAAGAACTA 444 



RESULT 8 

HSIGF1A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



616 bp 
la. 



HSIGF1A 

H . sapiens mRNA for IGF 
X56773 S61841 
X56773. 1 GI :32989 
IGF-1 gene. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 616) 



mRNA 



linear PRI 29-NOV-1993 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



gene 



CDS 



mat_peptide 



exon 

BASE COUNT 
ORIGIN 



Sandberg-Nordqvist,A.C.., Stahlbom, P .A. , Lake,M. and Sara, .V.R. 
Characterization of two cDNAs encoding insulin-like growth factor 1 
(IGF-1) in the human fetal brain 

Brain Res. Mol . Brain Res. 12 (1-3), 275-277 (1992) 

92186627 

1372070 

2 .{bases 1 to 616) 
Sandberg Nordqvist , A . C. 
Direct Submission 

Submitted ( 19-NOV-1990) A. C. Sandberg Nordqvist, KAROLINSKA INST'S 
DEPT OF PATHOLOGY, KAROLINSKA HOSPITAL, BOX 605 00, S-104 01 
STOCKHOLM , SWEDEN 

3 (bases 1 to 616) 

Sandberg-Nordqvist,A.C. , Stahlbom, P . A. , Reinecke,M., Col 1 ins, V. P., 
von Hoist ,H. and Sara,V. 

Characterization of insulin-like growth factor 1 in human primary 
brain tumors 

Cancer Res. 53 (11), 2475-2478 (1993) 

93265440 

8495408 

Location/Qualifiers 
1. .616 

/organism="Homo sapiens" 
/mol_type="mRNA" 
■/db_xref="taxon: 9606" 
/chromosome= " 12 " 
/map="q22-q24" 
/tissue__type= "brain" 
/ dev_s tage= " fetal " 
1. .462 
/gene=" IGF-1" 
1. -462 
/gene=" IGF-1" 
/codon_start=l 
/product- "IGF- la" 
/protein_id="CAA40092 . 1" 
/db_xref="GI : 32990" 
/db_xref="SWISS-PROT: P01343" 

/translation= "MGKISSLPTQLFKCCFCDFLKVKMHTMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDL 
RRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKEVHLKNASRGSAGNKNYRM" 
145. .354 
/gene="IGF-l" 

/product="IGF-la" " , 

403. .616 
/not e=" exon 5" 
159 a 158 c 160 g 139 t 



Query Match 66.6%; Score 344,2; DB 9; 

Best Local Similarity 87.3%; Pred. No. 2.5e-99; 
Matches 455; Conservative 0;. Mismatches 13; 



Length 616; 
Indels 53; 



Gaps 



5; 



Qy. 

Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 [ II 1 1 [ [ M I [ I i 1 1 1 1 M M 1 1 1 M I M 1 1 1) I 

14 5 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 204 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATC 12 0 

llllll M Ml MMIMI II MINI III III II I MM 1 1 II III II I II III IN II 

2 05 AGGGGCTTTTATTTCAACAAGCCC^CAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 264 

121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

I I I II II I II II II II II II II I || || || I M || || | M M II M II II II II II I II II 

265 ACAGG CATCGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 324 

181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

MMMIM II III MM II II Mill MM Illll Illll II Illll MUM llllll 

325 TGCGCACCCCTCAAGCCTGC(^GTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 384 

241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAA 3 0 0 

! . I I I I M I I I II ! 

385 ATGCCCAAGACCCAG 399 

3 01 AGGAAAGGAAGTA(^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAA 3 60 

1 1 M II M II II II II II II I Illll III Ml II Ml Ml III Ml MMMIM 

4 00 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAAC^AGAACTACAG 455 

3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MMM MMMIM llllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMIM 

456 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 515 
42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

II 1 1 1 MMMIM Mill MMM I II 1 1 II II II II 1 1 1 II II I MMM 

516 TGC^CGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 575 
478 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

. MMM Ml MMM MMIMI lllllllll II Mill 

576 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 616 



RESULT 9 
AX375028 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



BASE COUNT 
ORIGIN 



AX375028 7260 bp DNA linear PAT 01-MAR-2002 

Sequence 31 from Patent WO0210436. 

AX375028 

AX375028.1 GI : 19169860 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Baak,J. and . Mutter , G . L . 

Prognostic classification of breast cancer 
Patent: WO 0210436-A 31 07-FEB-2002; 
THE BRIGHAM AND WOMEN'S HOSPITAL, INC. (US) 

Loca t ion/ Qual i f iers 

1. .7260 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
2330 a 1415 c 1240 g 2275 t 



Baak, Jan (US) 



Query Match 66.6%; Score 344.2; DB 6; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 3.6e-99; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5; 



Qy 


i 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTOYGTTCGTGTGTGGAGAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I i i i i i '■ i i i i r i 


60 


Db 


311 


M 1 II 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

GGACCGGAGACGGTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


370 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 
i i i i i i i i i i i i i i i i t i i t i i i i i i i i i i i i i i t i i i i t * i t i i t iiii 


120 


Db 


371 


1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 I I I I | | | | | | | | | | | | | | | | | 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


430 


Qy 


121 


ACAGGCATCGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


431 


1 MINI IMIirilllMIII MINI III MINI MINIMI III Ml IIIIMII 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


490 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

1 1 1 1 1 I 1 1 j 1 1 1 1 1 1 1 1 t 1 I 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 l l > l l l i i i i t iii 


240 


Db 


491 


M II II 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 11 II 1 II 1 II M 1 1 M II II M 1 1 II II 1 1 1 II II 1 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


550 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 
i i i i i i i i i i i i i i i 
1 II 1 II 1 1 II 1 1 1 II 


300 


Db 


551 




565 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 
i i i i i i i i i i i i i i t i j i i i i i i i i i i i i i i i i i i i i i > ■ i 


360 


Db 


566 


M 1 1 M 1 M II 1 II 1 1 11 1 II II M II II 1 1 1 1 M 1 M 1 M 1 1 1 II 1 II II 1 1 1 1 

AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 


621 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 


1 J. J7 


Db 


622 


1 Mill 1 Mill III Mill III Ml III MM II Ml III II II III Mill 1 

GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 


681 


Qy 


420 


TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 


477 


Db 


682 


Mill MIMIIII Mill llllll Ml MIMIIMIMMIMI MUM 

TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 


741 


Qy 


478 


AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


74 2 


llllll III i 1 1 1 1 1 1 1 

AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 





RESULT 10 

AX411095 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AX411095 7260 bp DNA linear PAT 14-JUN-2002 

Sequence 3742 from Patent WO0229103. 

AX411095 

AX411095.1 GI:21443800 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Alvares,C, Horne,D., Peres -da-Silva , S . and Vockley,J.G. 
Gene expression profiles in liver cancer 
Patent:- WO 0229103-A 3742 ll-APR-2002; 
GENE LOGIC INC (US) 



FEATURES Location/Qualifiers 
source 1 . . 7260 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9606" 

/note="EMBL/GenBank Accession No. X57025" 
BASE COUNT 2330 a 1415 c 1240 g 2275 t 

ORIGIN 

Query Match 66.6%; Score 344.2; DB 6; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 3.6e-99; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ! 1 1 II 1 1 1 II II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3 70 

Qy 61 AGGGGCTTTTATTTQ^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

MMMMMIMMMMMMIMMMMIMMMMMMMIMMIMIMM 

Db 3 71 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

MMMIMMMMMMMMMMMIMMMMMMIMMMMMMIMM 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCT^ 24 0 

II Ml I Mill I Mill I IIIIIMIMM I Mill I MINI MM II I Mill I MM 

Db 4 91 TGCG(^CCCCTCAAGCCTGCC^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAA 3 00 

IIIIIIIIIMMII 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAC^VAGAACTACAG 3 60 

1 1 M I M M II II I II I Ml I 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 621 

Qy 361 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

1 1 M M II I M II II II M II M II 1 1 1 1 MM MM Mill II I MM Mill 

Db 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 42 0 TGCAC-AGTTACCTG-TAAACATTGGAATACCGGCCAA 4 77 

HIM MINIMI Mill 1 1 MM III 1 1 1 1 1 1 1 II 1 1 II I II II MUM 

Db 682 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MUM Ml IIMIIIIIMIIMIIIIIIMIMIIII 

Db 74 2 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 11 
HSIGFACI 

LOCUS HSIGFACI 7260 bp mRNA linear PRI 17-FEB-1992 

DEFINITION Human IGF-I mRNA for insulin-like growth factor I. 

ACCESSION X57025 

VERSION X57025.1 GI : 33007 

KEYWORDS insulin-like growth factor I. 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



CI eut j ens , C . B . and 



SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
REFERENCE 1 (bases 1 to 7260) 

AUTHORS Steenbergh, P.H. , Koonen-Reemst ,A.M. 
Sussenbach, J . S . 

TITLE Complete nucleotide sequence of the high molecular weight human 

IGF-I mRNA 

JOURNAL Biochem. Biophys . Res. Commun. 175 (2), 507-514 (1991) 
MEDLINE 91207342 
PUBMED 2018498 
REFERENCE 2 (bases 1 to 7260) 
AUTHORS Steenbergh, P.H. 
TITLE Direct Submission 

JOURNAL Submitted ( 18 -DEC- 1990) P.H. Steenbergh, LAB FOR PHYSIOLOGICAL 

CHEMISTRY, UNIVERSITY OF UTRECHT, VONDELLAAN 24 A, 3521 GG UTRECHT, 
THE NETHERLANDS 
FEATURES Location/Qualifiers 
source 1. .7260 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 9606" 
/chromosome="12 q22-24.1" 
/ t is sue_type=" liver" 
/dev_s tage= " adul t " 
gene l. .7260 

/gene="IGF-I" 
mRNA 1. .7260 

/gene=" IGF-I" 
/evidence=experimental 
exon 1 . . 2 2 9 

/gene-" IGF- I" 
/number =1 

/ evidence=experimental 
CDS 167. .628 

/gene=" IGF-I" 

/ codon_s t ar t = 1 

/evidence=experimental 

/product="insulin-like growth factor I" 
/protein_id="CAA40342 .1" 
/db_xref="GI : 33008" 
/db_xref ="SWISS-PROT: P01343" 

/ trans lat ion= " MGKI SSLPTQLFKCCFCDFLKVKMHTMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDL 
RRLEMYCAPLKPAKSARS VRAQRHTDMPKTQKE VHLKNASRGSAGNKNYRM " 
sig_peptide 167. .310 

/gene=" IGF-I" 
/evidence=experimental 
mat_peptide 311. ,520 

/gene=" IGF-I" 

/product="insulin-like growth factor I" 
/evidence=experimental 
exon 230. .386 

/gene=" IGF-I" 
/number=2 

/evidence=experimental 



exon 



exon 



polyA_signal 



repeat unit 



repeat_unit 



polyA_signal 



BASE COUNT 
ORIGIN 



2330 a 



387. .568 
/gene- "IGF- 1 " 
/number=3 

/evidence=experimental 
569. .7236 
/gene="IGF-I ,r 
/number =5 

/evidence=experimental 

861. .865 

/gene="IGF-I" 

/note="l.l kb mRNA" 

/ evidence=experimental 

3986. .4026 

/gene="IGF-I" 

/ note= " CA-repeat " 

/evidence=experimental 

5926. .6215 

/gene="IGF-I" 

/ evidence=experimental 

/rpt_f amily="AluI " 

7205. .7210 

/gene="IGF-l" 

/note="7.6 kb mRNA " 

/evidence=experimental 

1415 c 1240 g 2275 t 



Query Match 66.6%; Score 344.2; DB 9; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 3.6e-99; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 



Qy 

Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MIMIIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIMIIIMMIIIII 

311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 



Qy 

Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCA.CAGGGTATGGCTCCAGCAGTCGGA 12 0 

MIMMMMMIIMMIIMMMMMMMIMMMMMMMMMIIMM 

371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 
121 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 
431 ACAGGCATCGTGGATGAGTG CTGCTTCCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 4 90 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 



181 TGCG(^CCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCCA(^CCGAC 24 0 

IIMIIIIIMIIIIIMIIMIMIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIII 

4 91 TGCGCACCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 55 0 
241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAAC^^ 3 00 

1 1 1 i I ; 1 1 1 1 ; i I 

551 ATGCCCAAGACCCAG 565 

301 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 360 

1 1 1 1 1 r 1 1 1 1 1 1 f f r 1 1 f i f i iiiiiiiiiiiiiiiiiiiiMiiMiiMiiii 

566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 621 

3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACC CTTTG CTC 419 

MINI 1 1 II I II II III I MM MINI MM I Ml II I II I II I MUM II 



Db 



622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 



Qy 

Db 

Qy 
Db 



42 0 TGCAC-AGTTACCTG-TAAA(^TTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

Mill MINIMI Mill MINI III II II I M I II 1 1 1 M II I 1 1 II II 

682 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 
478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 1 1 1 1 1 III MINIMI III II INN I llllllllll 

742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 12 

A29119 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
JOURNAL 

FEATURES 

source 



CDS 



A29119 666 bp DNA linear PAT 15-JUN-1995 

H . sapiens IGF1 gene fragment from patent GB2241703. 

A29119 

A29119.1 GI: 1247520 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 666) 

Patent: GB 2241703-A 3 11-SEP- 199.1 ; 
Location/Qualifiers 
1. .666 

/ organ ism=" Homo sapiens" 

/mol_type=" genomic DNA" 

/db_xref ="taxon: 9606" 

25. .384 

/partial 

/codon_start=l 

/product=" IGF-1 precursor" 

/protein_id="CAA01955 . 1" 

/db_xref="GI : 4529932" 

/ trans 1 at i on= " MALCLLTFTSSATAGPETLCGAELVDALQFVCGDRGFYFNKPTG 
YGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKEV 
HLKNASRGSAGNKNYRM " 



mat_peptide 67. .276 

/product =" IGF-1" 
173 a 167 c 181 g 



BASE COUNT 
ORIGIN 



145 t 



Query Match 66.3%; Score 342.6; DB 6; Length 666; 

Best Local Similarity 87.1%; Pred. No. 8.2e-99; 

Matches 454; Conservative 0; Mismatches 14; Indels 53; Gaps 



5; 



Qy 
Db 

Qy 
Db 

Qy 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

M I II II M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M M II II 1 1 M I II I M 1 1 1 1 1 1 1 M II M 1 1 1 

67 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 126 
61 AGGGGCTTTTATTTCAAC^GCCCAa^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

i J 1 1 1 1 1 1 f f 1 1 1 1 1 1 1 ( 1 1 m 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 f f 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 

127 AGGGGCTTTTATTTCAACAAGCCCAO^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 186 
121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 



1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 

Db .18 7 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 24 6 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

1 1 1 1 1 1 1 i I I M I I I I I I I I II I II I I I I I I I I I I I I I I II I I I I I i | | | | | | | | | II | | 
Db 24 7 TGCGCACCCCTCAAGCeTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAG 3 06 

QY 241 ATGCCCAAGACCC^GAAGTATCAGCCCCCATCT 3 00 

MINN III 

Db 307 ATGCCCAAGACCCAG 321 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 3 60 

MIMIIIIIIIIMMIMI 1 

Db 322 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 3 77 

Qy 3 61 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

■ MINI MINIMI I MM II I II MM II 1 1 I II II 1 1 1 II 1 1 1 1 1 1 II II I 

Db 378 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 437 

Qy 420 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MINI III I I I I I I IIIIM 

Db 438 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 4 97 

Qy 4 78 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MINI Ml I I I , I I I I I I I . I I ; I I I I I I I I 

Db 4 98 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 53 8 



RESULT 13 

HSIGFI 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 
FEATURES 

source 



gene 



HSIGFI 725 bp mRNA linear PRI ll-DEC-1998 

Homo sapiens mRNA for insulin-like growth factor 1A precursor, 
complete CDS. 
X00173 

X00173.1 GI:33015 

growth factor; insulin super family; insulin-like growth factor I; 
signal peptide; somatomedin. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Jansen,M., van Schaik,F.M., Ricker,A.T. , Bullock, B. , Woods, D.E.,. 
Gabbay,K.H., Nussbaum, A. L . , Sussenbach, J . S . and Van den Brande,J.L. 
Sequence of cDNA encoding human insulin-like growth factor I 
precursor 

Nature 3 06 (5943), 609-611 (1983) 

84068210 

6358902 

Data kindly reviewed (28 -MAY-1984 ) by M. Jansen. 
Location/Qualifiers 
1. .725 

/organ ism- "Homo sapiens" 
/ mol_type= "mRNA" 
/db_xref ="taxon: 9606" 
1. .725 
/gene="IGF-l" 



CDS 



polyA_site 



BASE COUNT 
ORIGIN 



190 a 



12. .473 

/gene="IGF-l" 

/codon_start=l 

/product="insulin-like growth factor 1A precursor" 
/protein_id="CAA24998 - 1" 
/db_xref="GI : 33016" 
/db_xref="SWISS-PROT: P01343" 

/ translation=" MGKISSLPTQLFKCCFCDFLKVKMHTMSSSHLFYLALCLLTFTS 

SATAGPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDL 

RRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKEVHLKNASRGSAGNKNYRM" 

725 

/gene="IGF-l" 

174 c 183 g 178 t 



Query Match 66.3%; Score 342.6; DB 9; 

Best Local Similarity 87.1%; Pred. No. 8.3e-99; 
Matches 454; Conservative 0; Mismatches 14; 



Qy 

Db 



Length 72 5; 
Indels 53; 



Gaps 



5; 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MIMIMMIMIMMMMMIMMMMMMMMMMMMMMMMMI 

156 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 215 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACA^ 12 0 

I M I M II ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 

216 AGGGGCTTTTATTTC^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 275 
121 ACAGG CAT CGTGGATGAGTG CTG CTT CCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 18 0 

Mill MMMMMIMMIMMMMMIMMMMMMMMMMMIMM 

276 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 33 5 
181 TGCGCACCCCTCAAGCCTGCC^GTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

IIIIIIIIIIIIIIIIIMMIIIIIMIIIMIIIIIIMIIIMIIIIIIMIIIIII 

336 TGCGCACCCCTO^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 3 95 

241 ATG CCCAAGAC CCAGAAGTATCAG CCCCCATCTAC CAACAAGi^ 3 00 

III MM I I II I II I 

396 ATGCCCAAGACCCAG 410 

301 AGGAAAGGAAGTACATTTGAAGAAC^CAAGTAGAGGGAGTGOU^GA 360 

I I M I I I I I M I I I I II I I I I I I II I I I I I I I I I I I II I I I I I I I I I M M II II ■ 

411 AAGGAAGTAC^TTTGAAGAACGCAAGTAGAGGGAGTG(^GGAAAC^GAACTACAG 466 

361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

I MM I II Ml II II III III 1 1 Ml Ml MM 1 1 II II II II 1 1 1 1 1 1 1 1 1 1 1 

4 67 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 52 6 
42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

MMI MINI II I Mill llllll Ml Mill II I II III II Ml llllll 

527 TGCTiCGAGTTACCTGTTAAACTTTC^ 58 6 

478 AAAGAT -GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

llllll III llllll M I II Ml 1 1! 1 1 1 ! 1 1 1 1 1 MM 

58 7 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 627 



RESULT 14 



HUMGFII 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 
FEATURES 

source 



mRNA 
mRNA, 



linear PRI 
complete cds . 



08-NOV-1994 



gene 



CDS 



sig__peptide 



mat jpeptide 



BASE COUNT 
ORIGIN 



HUMGFII 72 8 bp 

Human insulin-like growth factor 
M29644 
M29644 .1 GI : 183119 
insulin-like growth factor. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 {bases 1 to 728) 

Rall,L.B., Scott, J. and Bell, G.I . 

Human insulin-like growth factor I and II messenger RNA: isolation 

of complementary DNA and analysis of expression 

Meth. Enzymol . 146, 239-248 (1987) 

88065102 

3683205 

Original source text: Human (adult) liver, cDNA to mRNA. 
Location/Qualifiers 
1. .728 

/organism="Homo sapiens" 

/ mol__type=: " mRNA" 

/db_xref="taxon: 9606" 

/map="12q23" 

1. .728 

/gene="IGFl" 

81. .473 

/gene="IGFl M 

/note="insulin-like growth factor I precursor" 

/codon_start=l 

/protein_id="AAA52543 . 1" 

/db_xref ="GI : 183120" 

/db_xref="GDB: GOO -12 0-081" 

/ 1 rans 1 a t ion= " MHTMSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCG 
DRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPAKSARSVRAQR 
HTDMPKTQKEVHLKNASRGSAGNKNYRM " 
81. .155 
/gene="IGFl n 

/note="insulin-like growth factor I signal peptide" 
156. .365 
/gene="IGFl" 

/product="insulin-like growth factor I" 
193 a 174 c 183 g 178 t 
Chromosome 12q23 . 



Qy 

Db 

Qy 

Db 



Query Match 66.3%; Score 342.6; DB 9; Length 728; 

Best Local Similarity 87.1%; Pred. No. 8.3e-99; 

Matches 454; Conservative 0; Mismatches 14; Indels 53; Gaps 5; 

1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

_ MIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIMMIIIIIMIIIIMIII 

156 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 215 

61 AGGGGCTTTTATTTCAAQU^GCCCACAGG 12 0 

_ I 1 1 i I f J I I I i I I I I I I f I I I j J f I f I I I I j I I I ] | f j | | | I ! | | | | | f ( | I J I I M I I I 

216 AGGGGCTTTTATTTCAA(^GCC(^(^GGGTATGGCTC(^GCAGTCGGAGGGCGCCTCAG 275 



Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 

i 1 1 1 1 1 1 1 1 1 I 1 1' 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 I! 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 l ! I l l i i 


180 


Db 


276 


Mill 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I I I I I || M | | | M | M | M 1 II 1 1 II 1 1 1 1 1 1 I I I I 

ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


335 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 
i i i i i t i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i < i i i 


240 


Db 


336 


M 1 1 1 M II 1 II 1 II 1 1 II II 1 M 1 1 1 1 1 M 1 M 1 II II 1 II II 1 1 II II 1 II II II 1 M 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


395 


Qy 


241 


ATGCCCAAGACCOVGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 
i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


300 


Db 


396 


M M M 1 II M II II 


410 


Qy 


301 


AGGAAAGGAAGTA(^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l I i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i 

II M II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 I M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1| | | | | 

AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 


360 


Db 


411 


466 


Qy 


361 


GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 
1 1 1 1 1 1 I 1 I I I I I I I I I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i 


419 


Db 


467 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 


526 


Qy 


420 


TGCAC - AGTTACCTG - TAAAC^TTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 

Mill II Ml MM lllll II MM Ml Ml MM Ml Mill III MIMI 

TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 


477 


Db 


527 


586 


Qy 


478 


AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MIMI III MMMMMMMMIMIIMMIMM 

AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 627 




Db 


587 





RESULT 15 

HUMIGFI 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



gene 



HUMIGFI 1076 bp mRNA linear PRI 08 -NOV- 19 94 

Human insulin-like growth factor mRNA, complete cds . 

M27544 

M27544 . 1 GI : 184829 
insulin-like growth factor. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1076) 

Le Bouc,Y., Dreyer,D., Jaeger,F., Binoux,M. and Sondermeyer, P . 

Complete characterization of the human IGF-I nucleotide sequence 

isolated from a newly constructed adult liver cDNA library 

FEBS Lett. 196 (1), 108-112 (1986) 

86108910 

2935423 

Original source text : Human liver, cDNA to mRNA, clones 
lanbda-TG[03,04,05] . 

Loca t ion/Qual i f iers 

1. .1076 

/organism="Homo sapiens" 

/mol_type= ,! mRNA n 

/db_xref = n taxon: 9606" 

/map= n 7pl3-pl2" 

1. .1076 

/ gene= " IGFBP1 " 



mRNA <l. . >1076 

/gene="IGFBPl" 

/note="IGF mRNA (alt J" 
mRNA <1. ,989 

/gene="IGFBPl" 

/note="lGF mRNA (alt.)" 
CDS 149. .610 

/gene= M IGFBPl" 

/note="insulin-like growth factor precursor" 

/codon_start=l 

/protein_id="AAA52787 . 1" 

/db_xref ="GI : 306927" 

/db_xref-"GDB: GOO -12 0-075" 

/translat ion= "MGKISSLPTQLFKCCFCDFLKVKMHTMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDL 
RRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKEVHLKNASRGSAGNKNYRM" 
sig_peptide 149. .292 

/gene="IGFBPl" 

/note="insulin-like growth factor signal peptide" 
mat_peptide 293. .502 

/gene-"IGFBPl" 

/product^" insulin-like growth factor" 
BASE COUNT 283 a 251 c 239 g 303 t 

ORIGIN Chromosome 7pl3-pl2. 



Query Match 66.3%; Score 342.6; DB 9; Length 1076; 

Best Local Similarity 87.1%; Pred. No. 8.8e-99; 

Matches 454; Conservative 0; Mismatches 14; Indels 53; 



Gaps 



5; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

i f f i i f i i i i f i i i r i f i i f i r i i i i i i i i i i i ri i i f i i i i i i i i i i i i i [ j i i i i i i r 

2 93 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3 52 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

IN Mllllllll MINIMI MINIMI Mill Mill I MINIUM III Mill 

353 AGGGGCTTTTATTTCAA(^GCCC^(^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 412 
121 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

N II -1 1 1 1 1 1 1 1 Mllllllll II I Mill 1 1 1 IN IN 1 1 1 INN INN IIIINN 

413 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 72 
181 TGCGC^CCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCCAC^CCGAC 24 0 

1 1 1 1 1 1 1 1 1 1 1 ! M 1 1 M 1 1 ! II 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 II I M M 1 1 1 1 1 1 1 1 1 1 

473 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCOVGCGCCACACCGAC 532 
241 ATGCCCAAGACCC^GAAGTATCAGCCCCCATCTACCAACAAGAACA^ 3 00 

533 ATGCCCAAGACCCAG 547 

301 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAC^^ 360 

m 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 i f 1 1 j 1 1 1 f 1 1 1 1 1 1 1 r r f i f 1 1 f r i r j i f i r r i 

54 8 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 603 

361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MINI NINIIII 1 1 II 1 1 1 1 1 1 1 M I INI 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 f I F I 

604 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 663 



QY 420 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

cc Mill Ml MM II Mill Ml Ml IN MM III I MUM III I llllll 

Db 664 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 723 

Qy 478 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

M M 1 1 III I M 1 1 II I M I M I M 1 1 M I II I M M I 

Db 724 AAAGATGGG CGTTTCCC CCAATGAAATA C A PA AHT A A A PTT 7£4 



Search completed: December 13, 2003, 09:27:32 
Job time : 2313.97 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 13, 2003, 02:35:18 ; Search time 207.586 Seconds 

(without alignments) 
6723.048 Million cell updates/sec 

US-09-852-261-1 
517 

1 ggaccggagacgctctgcgg. ........ . tgaaatacacaagtaaacat 517 

IDENTITYJSTUC 

Gapop 10.0 , Gapext 1.0 



Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5105512 



Database : 



N_Geneseq_19Jun03 : * 

/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 0 . DAT : * 
I SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 1 . DAT : * 
I SIDSl/gcgdata/geneseq/ geneseqn-embl /NA1982 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1983 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 . DAT: * 
/ SIDSl/gcgdata/geneseq/ geneseqn-embl /NA1 98 5 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 6 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1987.DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1988 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1989 . DAT: 
/SIDSl/ gcgdata/ geneseq/geneseqn-embl/NA1990 . DAT: 
/SIDS1/ gcgdata/geneseq/geneseqn~embl/NA19 91 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 .DAT: 
/ SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 .DAT: 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 . DAT: 
/ SIDSl/ gcgdata/ geneseq/ geneseqn-embl /NA1 9 95 . DAT: 
/ SIDSl/ gcgdata/ geneseq/ geneseqn-embl /NA1 9 96 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 . DAT: * 
/ SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 .DAT: * 
/ SIDSl/gcgdata/geneseq/geneseqn-embl/NA1999 . DAT: * 
/ SIDSl/ gcgdata/ geneseq/ geneseqn-embl/NA2 0 00 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 001A.DAT: * 
/ SIDSl/ gcgdata/geneseq/geneseqn-embl/NA2 001B . DAT : * 
/ SIDSl/ gcgdata/ geneseq/ geneseqn-embl /NA2 002 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 003 .DAT: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


517 


100 


. 0 


517 


22 


AAD06398 


Human IGF-I isofor 


2 


517 


100 


. 0 


517 


24 


AAS16877 


Human mechano-grow 


3 


467 


.4 


90 


. 4 


523 


22 


AAD06400 


Rabbit IGF-I isofo 


4 


467 


. 4 


90 


. 4 


523 


24 


AAS16879 


Rabbit mechano-gro 


5 


467 


.4 


90 


. 4 


553 


18 


AAT84 8 93 


Rabbit insulin lik 


6 


377 


.2 


73 


. 0 


471 


22 


AAD06405 


Rabbit liver-type 


7 


377 


.2 


73 


. 0 


471 


24 


AAS16884 


Rabbit insulin-lik 


8 


344 


.2 


66 


. 6 


818 


8 


AAN70436 


Sequence encoding 


9 


344 


.2 


66 


. 6 


7260 


24 


. ABT11091 


Human breast cance 


10 


344 


.2 


66 


. 6 


7260 


24 


ABK84 583 


Human cDNA differe 


11 


344 


.2 


66 


. 6 


7260 


24 


ABN97244 


Gene #3742 used to 


12 


344 


.2 


66 


. 6 


7260 


24 


ABK64 812 


Human benign prost 


13 


344 


.2 


66 


. 6 


7260 


24 


ABK3 5504 


Human endometrial 


14 


344 


.2 


66 


. 6 


7260 


24 


ABK35561 


Gene IGF1 differ en 


15 


342 


.6 


66 


. 3 


777 


18 


AAT84894 


Human insulin like 


16 


339 


.4 


65 


. 6 


622 


7 


AAN60490 


Human prepro-somat 


17 


325 


.2 


62 


. 9 


539 


22 


AAD06399 


Rat IGF-I isoform 


18 


325 


.2 


62 


. 9 


539 


24 


AAS16878 


Rat mechano -growth 


19 


318 


.2 


61 


. 5 


651 


25 


ABV76185 


Mouse insulin-like 


20 


308 


.6 


59 


. 7 


1136 


8 


AAN70435 


Sequence encoding 


21 


286 


.4 


55 


. 4 


3599 


19 


AAV50428 


Plasmid pIG0552 lo 


22 


286 


.4 


55 


. 4 


3599 


19 


AAV4 0796 


Actual sequence of 


23 


286 


.4 


55 , 


. 4 


3600 


19 


AAV50427 


Plasmid pIG0552 up 


24 


286 


. 4 


55 


. 4 


3600 


19 


AAV4 0795 


Expected sequence 


25 


286. 


.4 


55 . 


. 4 


5707 


20 


AAX88055 


Plasmid pIG0335 DN 


26 


286, 


.4 


55 . 


. 4 


6345 


20 


AAX88054 


Plasmid pIGOlOOA D 


27 


285. 


.4 


55 . 


. 2 


612 


22 


AAS14695 


s Human cDNA en cod in 


28 


285. 


.4 


55 . 


, 2 


612 


25 


ABZ83309 


Toxicologically re 


29 


271. 


.2 


52 . 


. 5 


978 


14 


AAQ4 7804 


Sequence encoding 


30 


258 . 


.4 


50 . 


0 


317 


24 


AAS16882 


Human insulin- like 


31 


258 . 


.4 


50 . 


0 


318 


22 


AAD06403 


Human liver- type I 


32 


258 . 


.4 


50 . 


0 


462 


19 


AAV5 042 6 


Human IGF-1 encodi 


33 


258 . 


4 


50. 


0 


462 


19 


AAV4 0794 


Human IGF-I coding 


34 


258. 


4 


> 50. 


0 


462 


24 


ABZ35734 


Human IGF1 polynuc 


35 


258 . 


4 


50. 


0 


462 


24 


ABX09977 


Human IGF1 DNA fra 


36 


258 . 


4 


50. 


0 


462 


24 


ABV78158 


Human IGF1 DNA SEQ 


37 


258 . 


4 


50. 


0 


462 


24 


ABL91699 


Human polynucleoti 


38 


252 . 


6 


48 . 


9 


1052 


20 


AAX27498 


Rat liver form of 


39 


247 . 


8 


47 . 


9 


487 


22 


AAD06404 


Rat liver-type IGF 


40 


247. 


8 


47 . 


9 


487 


24 


AAS16883 


Rat insulin-like g 


41 


234 . 


2 


45. 


3 


671 


24 


ABT09479 


Phase- 1 Rat CT gen 


42 


210 


40. 


6 


210 


24 


AAD45568 


Human insulin- like 


43 


210 


40. 


6 


210 


24 


AAD44955 


Human insulin grow 


44 


210 


40 . 


6 


210 


24 


ABA0314 6 


Native mature IGF- 


45 


208 . 


4 


40. 


3 


237 


12 


AAQ13568 


Beta-gal/lGF-1 fus 



ALIGNMENTS 



RESULT 1 
AAD06398 

ID AAD06398 standard; cDNA; 517 BP. 
XX 

AC AAD06398; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human IGF-I isoform mechano -growth factor (MGF) cDNA. 
XX 

KW Human; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy- 

KW poliomyelitis; post -polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1..333 
FT /*tag= a 

FT /product^ "Mechano -growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 

PN WO200136483-A1 . 

XX . 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. ^ 

DR P-PSDB; AAE02447. 

XX 

PT Use of mechano -growth factor, an isoform of Insulin-like Growth 

PT Factor- I , capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 49-50; 66pp ; English. 
XX 

CC The present invention relates to use of mechano -growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 



CC including a disorder of mot on eu rones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones , motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is human IGF- I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 

XX 

SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 



Query Match 100.0%; Score 517; DB 22; Length 517; 

Best Local Similarity 100.0%; Pred. No. l".6e-146; 

Matches 517; Conservative 0; Mismatches 0; Indels 0; Gaps ^ 0; 



Qy 


l 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 
1 i i i i i i i i i i i i t i i i i i i i t t r i i i i i ■ i i i i i i i i i i i i i i i i i i i i , i i , i 


60 


Db 


l 


M II II II II 1 II M 1 1 1 1 II 1 M 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Qy 


61 


AGGGGCTTTTATTTC^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 
1 i i i i i i i i i j i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 


120 


Db 


61 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGGGG CTTTTATTT CAACAAG C C CACAGGGTATGG CT CCAG CAGTCGGAGGG CG CCTCAG 


120 


Qy 


121 


ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 


1 ft 0 


Db 


121 


IIIIIIIIIIMIIIIIIIIIMIIMMMIIMIIIIIIIIIMIIMIIIIMIMI 

ACAGG CATCGTGGATGAGTG CTG CTT CCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 


180 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

MIIIIMIIIIMMMIIIIIIIIIIIIIIMIIMIMIIIIIIIIIMIIIIMM 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


181 


240 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCC(^ 

1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 1 1 II 1 1 1 II 1 1 1 1 

ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 


300 


Db 


241 


300 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTG 


360 


Db 


301 


IN MINI MINIM MINI 1 MINI IIIIIIIMIIIIIIIIIIIIIMIIIMI 

AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 


360 


Qy 


361 


GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 


420 


Db 


361 


1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 | | | | | | | || | | | | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 

GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 


420 


Qy 


421 


GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGT 


480 


Db 


421 


MIIMMMMMMMIMMMMMMIMMMMMMMMMMMMMM 

GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTC 


480 


Qy 


481 


GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

IIIIMIIMIMMIIIIIIIIIIIIIIIIIIIIII 

GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


481 





RESULT 2 
AAS16877 

ID AAS16877 standard; cDNA; 517 BP. 
XX 

AC AAS16877; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human me chano -growth factor (MGF) cDNA. 
XX 

KW Human; mechano -growth factor; insulin-like growth factor I; IGF-I; MGF; 
KW neuroprotective; nerve damage; peripheral nervous system; nerve severing ; 
KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 
KW nerve avulsion. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . .333 

FT /*tag= a 

FT /product = "Human MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1. .76' 

FT /*tag= b 

FT /numbers 3 

FT exon 77. .259 

FT /*tag= c 

FT / number = 4 

FT exon 260 . . 307 

FT /*tag= d 

FT / number = 5 

FT exon 308.. 330 

FT /*tag= e 

FT / number = 6 

XX 

PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2001WO-GB02054 . 
XX 

PR 10-MAY-2000; 2 00 0GB- 0 0 11278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON . 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 
DR P-PSDB; AAU10559. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano -growth, factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage - 

XX 



PS Claim 11; Fig 5; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF- 1) isoform, known as mechano -growth factor (MGF-) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system; or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the human MGF . 

XX 

SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 



Query Match 100.0%; Score 517; DB 24; Length 517; 

Best Local Similarity 100.0%; Pred. No. 1.6e-146; 

Matches 517; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


1 


1 1 II 1 M 1 M 1 1 M MINI IIIMMIIMMIIMMMMMMII Mill MUM 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Qy 


61 


AGGGGCTTTTATTTCAA(^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


61 


MMMMMMMMMMMMMMMMMMMMMMMMIMMMIMM 

AGGGGCTTTTATTTCAACAAGCCCAC^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 

MIIMIIIMMIIIIMMIIIIIIMIIIIMIMIIIIMIMMIIIIIIIIMI 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


121 


180 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

1 M 1 MM 1 M 1 1 1 II 1 1 1 1 1 1 II II 1 II II II 1 1 II II II 1 1 II II 1 1 II 1 II 1 1 1 1 1 1 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


181 


240 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAG 

1 1 f 1 1 1 1 1 ) i f 1 1 1 f i f 1 1 1 r 1 1 r 1 1 1 1 1 f i m i ( 1 1 r 1 1 f [ i [ 1 1 1 1 1 1 1 1 ( 1 1 f m 1 1 

ATGCCCAAGACCCAGAAGTATCA.GCCCCCA^ 


300 


Db 


241 


300 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAA 

1 1 1 1 1 1 1 II If 1 M 1 M 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 

AGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 


360 


Db 


301 


360 


Qy . 


361 


GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 

II 1 M 1 1 1 f 1 f M ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ( 1 1 1 1 1 II 1 1 1 1 1 M M M 1 1 1 1 1 1 1 

GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 


420 


Db 


361 


420 


Qy 


421 


GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATC^CATTTCA^ 

1 1 f f 1 1 r f 1 1 1 1 1 r 1 1 1 1 f 1 1 1 1 1 1 1 f 1 1 1 f r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 m 1 1 

GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTCAAA 


480 


Db 


421 


480 


Qy 


481 


GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M M M 1 1 III 1 M 

GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


481 





RESULT 3 
AAD06400 

ID AAD064 00 standard; cDNA ; 523 BP. 
XX 

AC AAD064 00; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit IGF-I isoform mechano -growth factor (MGF) cDNA. 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Oryctolagus cuniculus. 
XX 

FH Key Location/Qualifiers 
FT CDS 1..336 
FT /*tag= a 

FT /product= "Mechano -growth factor (MGF)" 

FT /note- "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB- 002 6968 . 
XX 

PA. (UNLO ) UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 
DR P-PSDB; AAE02449. 
XX 

PT Use of mechano-growth factor, an isoform of Insulin- like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 53-54; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 
CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 
CC medicament for the treatment of neurological disorder. The MGF is capable 
CC of reducing motoneurone loss by 2 0% or greater in response to nerve 
CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 
CC rescue. The MGF polynucleotide and polypeptide are useful in the 



CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex- linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rabbit IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF' protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF . 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 

Query Match 90.4%; Score 467.4; DB 22; Length 523; 

Best Local Similarity 96.2%; Pred. No. 1.8e-131; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps ,2 



wy 


1 

X 


UUALLbbAbALbLTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


JJU 


± 


iMiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

CjrLrALLbOAbALGLTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


wy 


£ 1 


A(jLi(j(jLI 1 TTATTTCAACAAGCCCAC^GGGTATGGCTCCAGCAGTCGGAGGGCGCCT 


120 


UJO 


bl 


Ml MMIIIIMII llllllllllllll || llllll llllllllllllll II II 11 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGC^ 


120 


Qy 


121 


ACAGGCAT CGTGGATGAGTG CTG CTT CCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 


180 


Db 


121 


MMMMIIIIMIIIIIIIIMIIMIIIIMIIMMI MMMMMMIMM 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Qy 


181 


TGCGCACCCCTC^GCCTGCC^GTC^GCT^ 


240 


Db 


181 


II II Ml MM II Ml II III MM MMI I F 1 1 f 1 1 1 1 1 1 1 1 1 II 1 1 i [ 1 f 1 1 

TGTGCACCCCTCAAGCCGGOWVG^ 


240 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACA - -G 


297 


Db 


241 


M M MM Ml II Ml III MM II MM III Ml II Mill 1 MMMIII 1 

ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACC^ 


3 00 


Qy 


298 


AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 


357 


Db 


301 


M II M II II 1 1 M 1 1 1 1 II II II M II II M 1 1 1 M 1 1 M 1 1 II M II 1 1 1 1 II 1 1 1 M 

AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGG 


360 


Qy 


358 


CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 


416 


Db 


361 


MMMIII MMMMMMMMMMMMMMMMMMMMIMMIMM 

CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 


420 


Qy 


417 


CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATA^ 


476 


Db 


421 


M M 1 1 II 1 1 II II II II 1 1 1 1 M M 1 1 M 1 II 1 1 1 II 1 II II 1 1 M 1 II 1 1 1 1 M II M 

CTCTGCACAGTTACCTGTAAACATTGGAAT^ 


480 


Qy 


477 


CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


481 


1 II II M M 1 1 1 1 1 M II 1 1 1 1 1 II M 1 1 1 II II M M 1 M 

CAAAGATGGC^TTTCCCCCAATGAAATACAC^AGTAAACAT 521 





RESULT 4 
AAS16879 

ID AAS16879 standard; cDNA; 523 BP. 
XX 

AC AAS16879; 

XX . 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit mechano-growth factor (MGF) cDNA . 
XX 

KW Rabbit; mechano-growth factor; insulin- like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avulsion. 
XX 

OS Oryctolagus cuniculus . 



XX 

FH Key Location/Qualifiers 

FT CDS 1 . .336 

FT /*tag= a 

FT /product= "Rabbit MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1. .16 

FT /*tag= b 

FT /number = 3 

FT exon 77. .259 

FT /*tag= c 

FT /number = 4 

FT exon 260 . .309 

FT /*tag= d 

FT /number^ 5 

FT exon 311 . . 333 

FT /*tag= e 

FT / number = 6 

XX 



PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 001WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278. 
XX 

PA (UNLO ) UN IV COLLEGE LONDON . 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10561. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage - 



XX 

PS Disclosure; Fig 7; 65pp; English. 
XX 

CC The invention relates to the use of an insulin- like growth factor I 

CC (IGF-I) isoform, known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit MGF. 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 

Query Match 90.4%; Score 467.4; DB 24; Length 523; 

Best Local Similarity 96.2%; Pred. No. 1.8e-131; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2 



Pit/ 


1 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


JJJLJ 


1 


IMIIIIMIIIIIIIMII MMIMMMIMIMIMIMMMMIMIMIMI 

GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Qy 


61 


AGGGGCTTTTATTT(^(^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


61 


MMIMI Ml IMIIMIIIII IIMM II IMIIMIIII III MINI MINI 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGG 


120 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


121 


imilirilllll MINIM III III II III MINI II II MUM II Ml II II 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


181 


M MIMMMMMI II III MM Mill 1 1 f 1 1 1 1 1 1 [ 1 1 1 1 1 i 1 1 1 1 r 1 1 1 

TGTGGA,CCCCTCAAGCCGGCAAAGGaVGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 


240 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCT^ - -q 


297 


Db 


241 


IMIIMIIII MIMMMMMI MM II IIIMIIMM 1 MIIIIMI 1 

ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 


300 


Qy 


298 


AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


357 


Db 


301 


1 M II II 1 II II M II II 1 1 II II M II II II II M M II II II II 1 1 M II II 1 II M 1 

AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAG 


360 


Qy 


358 


CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 


416 


Db 


361 


M Mill II MIIIIMI IMIIMIIII II MM II IIMM IMMMMII Mill 

CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 


420 


Qy 


417 


CTCTGCACAGTTACCTGTAAACATTGGAAT^ 


476 


Db 


421 


MMMMMMMMMMMMMMMMMMMIMM IMIIMIIII IIMM 

CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCC^ 


480 


Qy 


477 


CAAAGATGGCATTTCCCCC^TGAAATAC^C^AGTAAACAT 517 





I IIMM 1 1 III III II I MINI II II I II III III II 



Db 



481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 



RESULT 5 
AAT84893 

ID AAT84893 standard; cDNA; 553 BP. 
XX 

AC AAT84893; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Rabbit insulin like growth factor 1 encoding cDNA. 
XX 

KW Insulin like growth factor 1; IGF-1; Ec peptide; muscle disorder; 

KW heart; neuromuscular disease; primer; ss. 

XX 

OS Oryctolagus cuniculus\ 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . .366 

FT /*tag= a 

FT /product = " IGF-1" 

XX 

PN W09733997-A1. 
XX 

PD 18-SEP-1997. 
XX 

PF ll-MAR-1997; * 97WO-GB00658 . 
XX 

PR ll-MAR-1996; 96GB-0005124 . 
XX 

PA (UNLO ) ROYAL FREE HOSPITAL SCHOOL MED . 
XX 

PI Goldspink G; 
XX 

DR WPI; 1997-470877/43. 

DR P-PSDB; AAW23301. 
XX 

PT Use of insulin like growth factor I characterised by presence of Ec 

PT peptide - to treat humans or animals, particularly muscle disorders, 

PT heart conditions or neuromuscular diseases 
XX 

PS Disclosure; Fig 3; 33pp ; English. 
XX 

CC A use of insulin like growth factor I (IGF-1) has been developed, and 

CC is characterised by the presence of the Ec peptide, or a functional 

CC equivalent, in the treatment or therapy of a human or animal. The IGF-1 

CC polypeptide can be used to treat muscular disorders, e.g. Duchenne or 

CC Becker muscular dystrophy, autosomal dystrophies and related progressive 

CC skeletal muscle weakness and wasting, muscle atrophy in ageing humans, 

CC spinal cord injury induced muscle atrophy and neuromuscular diseases/ 

CC and cardiac disorders, e.g. diseases where promotion of cardiac muscle 

CC protein synthesis is a beneficial treatment, cardiomyopathies and acute 

CC heart failure or insult, specifically myocarditis or myocardial 

CC infarction. It can also be used to promote bone fracture healing and 

CC maintenance of bone in old age. The present sequence encodes rabbit 

CC IGF-1 used in the present specification. 



XX 



SQ 


Sequence 


553 BP; 159 A; 142 C; 147 G; 105 T; 0 Other; 




Query Match 90.4%; Score 467:4; DB 18; Length 553; 
Best Local Similarity 96.2%; Pred. No. 1.8e-131; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 


2 


Qy 


1 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

MMMMMIIMMIMI MIIIMMIIIIMIIMIIIMIIIIIIIIIIIIIII 

GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


31 


90 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 

HUM Mill MINI Mill II Mill II MIMMIMIMMIMM Ml Ml 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 


120 


Db 


91 


150 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 

1 M 1 1 1 II 1 M 1 II 1 1 1 M II 1 1 II II 1 1 1 II 1 II 1 II 1 M 1 1 1 1 1 1 1 1 1 1 II 1 M II 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Db 


151 


210 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


211 


M M II 1 1 II 1 1 1 II 1 M III MM Mill MMIIIIIIMIIIMIIIIIII 

TGTGCACCCCTC^AGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 


270 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCA- - - G 


297 


Db 


271 


M II 1 1 M II 1 1 MIMMMMIMMI 1 MIIIIMI 1 

ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 


330 


Qy 


298 


AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


357 


LJSJ 


j> J 1 


IMIIIIIMIIIMIIIIMIIIIIIIIMIIIIIIIIIIIIMIIIIMIIIJIIIM 

AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


390 


Qy 


358 


CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 

MUM Ml MMMMMMMMMIMMMMMMMIMIMMMMIIMI 

CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 


416 


Db 


391 


450 


Qy 


417 


CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 


476 


Db 


451 


1 1 1 M 1 1 1 1 1 II 1 1 1 1 II II 1 1 1| || M 1 1 1 M M 1 M 1 II M 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 

CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 


510 


Qy 


477 


CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


511 


M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M M 1 II 1 1 II 1 II 1 1 1 II 

CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 551 





RESULT 6 
AAD06405 

ID AAD06405 standard; cDNA; 471 BP. 
XX 

AC AAD06405; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit liver-type IGF-I isoform (L.IGF-I) cDNA. 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano-growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy] 



KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder ; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy ; 

KW Alzheimer's disease; Parkinson's disease; liver; L.IGF-I; ss. 

XX 

OS Oryctolagus cuniculus. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .318 

FT /*tag= a 

FT /product^ "Liver-type IGF-I isoform (L.IGF-I)" 

FT /transl_except= (pos:7..9, aa:Gln) 

FT /transl_except= (pos:25..27, aa:Gln)- 

FT /note= "These translation exceptions occur while decoding 

FT the alternative version of the protein (AAE02456) . 

FT The CDS comprises exons 3, 4 and 6 and 

FT does not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB043 54 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON . 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02452, AAE02456. 

XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Disclosure; Page 59-60; 66pp ; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rabbit liver-type IGF-I isoform (L.IGF-I) cDNA. 

CC The L.IGF-I protein comprises amino acid sequences encoded by 



CC nucleic acid sequence of IGF-I exons 4 and 6. 
XX 

SQ Sequence 471 BP; 132 A; 118 C; 131 G; 90 T; 0 other; 

Query Match 73.0%; Score 377.2; DB 22; Length 471; 

Best Local Similarity 87.8%; Pred. No. 3.8e-104; 

Matches 4 55; Conservative 0; Mismatches 13; Indels 50; Gaps 2 
QY 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ! i 1 1 IIMMMMIMMMMMMMMMMIMIMM 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

QY 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

lUIIIIIIMIMMMIIIIMIIIII II IIIIIIIIIIIIIIIIIIII MINI 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

MIIIIIIIIMIIMIIIIIIIMIIIIIIIIIIIIIIII lllllllllllllllll 

Db 121 ACAGG CAT CGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGCGCACCCCTCAAGCCTGCC^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I M 1 1 1 1 1 1 1 1 1 1 i II Ml 1 1 1 1 Mill IIMMIIMMMMMIIMM 

Db 181 TGTGC^CCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACA^ 3 00 

IMMMIMI Ml 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACmGTAGAGGGAGTGCAGGAAACAAGAACTAC^G 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 M ! II I II 1 1 f 1 1 II ! II 1 1 1 1 1 II 1 1 1 1 1 [ 1 1 1 1 1 1 M 1 1 1 1 

Db 2 56 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAG 311 

Qy 361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MINI II 1 1 1 M 1 1 1 1 II 1 1 1 1 J II 1 1 II I II II II II 1 1 1 1 II 1 1 1 1 1 1 1 II II M I 

D b 312 GATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 371 

Qy 42 0 TGCACAGTTACCTGTAAACM 479 

M II 1 1 MM I II III III MM I Mill I Mill II II I II III III Mill I MM II 

°b 3 72 TGCACAGTTACCTGTAAACATTGGA 43 1 

Qy 48 0 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

IMIMMMMMMMMIMMMMMMIMM 

Db 432 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 4 69 



RESULT 7 
AAS16884 

ID AAS16884 standard; cDNA; 471 BP. 
XX 

AC AAS16884 ; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit insulin-like growth factor I liver-type isoform (L. IGF-I) cDNA. 
XX 

KW Rabbit; mechano -growth factor; insulin-like growth factor I; IGF-I; MGF; 
KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 



KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss 

KW nerve avulsion; insulin-like growth factor I liver-type isoform; L.IGF-I 
XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . .318 

FT /*tag= a 

FT /product = "Rabbit L.IGF-I" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 75 

FT /*tag= b 

FT /number- exon 3 

FT exon 76 . . 258 

FT /*tag= c 

FT /number = exon 4 

FT exon 259 . .315 

FT /*tag= d 

FT /number= exon 6 

XX 

PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 0 0 1WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2 000GB- 0011278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST . 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10564. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano -growth factor which is encoded by IGF-I exons 4,5,6 'and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 10; 65pp ; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit insulin-like 



CC growth factor I liver-type isoform (L.IGF-I) used in experiments on 

CC motoneuron loss. 

XX 

SQ Sequence 471 BP; 132 A; 118 C; 131 G; 90 T; 0 other; 

Query Match 73.0%; Score 377.2; DB 24; Length 471; 

Best Local Similarity 87.8%; Pred. No. 3.8e-104; 

Matches 455; Conservative 0; Mismatches 13; Indels 50; Gaps 2 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIIIMIIIIMMIIII MIIIIIMMIMIIIIMIIIMIIIIIIIIIIIMI 

Db 1 GGACCGGAGACGCTCTGCGGTGGTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

QY 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

INI Ml III 1 1 1 MM MM IIIIIIII II MIIIIMIIIIIIMMII | Mill 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

M II 1 1 1 II M 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M IIMMIIIIIMIMI 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 II III MM Mill I ! I r 1 1 1 1 f I j I i 1 1 1 1 1 1 1 J i f I 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 3 00 

Ml Mill: Ml 

Db 241 ATGCCCAAGACTCAG 255 

Qy 301 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAC^GAACTACAG 3 60 

M II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II I II 1 1 M I M 1 1 1 1 1 M 1 1 

Db ,256 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 311 

Qy 361 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

Mini 1 1 1 r ( 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 f 1 1 f 1 1 1 1 1 1 1 ( 1 1 r i f 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 

Db 312 GATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 371 

Qy 42 0 TGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC^ 479 

Ml MM I IMI MM II MINI MM 1 1 III MM MM I Ml I MM MM | IN 1 1 

Db 372 TGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTCAA 431 

Qy 48 0 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

I M II 1 1 1 1 1 M 1 1 II II 1 1 1 1 M I M M I M 1 1 M M 

Db 432 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 469 



RESULT 8 
AAN70436 

ID AAN70436 standard; cDNA; 818 BP. 
XX 

AC AAN70436; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-APR-1991 (first entry) 

XX 

DE Sequence encoding insulin-like growth factor 1A (IGF-1A) . 
XX 



KW Growth promoter; lactation enhancer; cell proliferation; ss. 
XX 

OS Homo sapiens. . 
XX 

PN EP229750-A. 
XX 

PD 22-JUL-1987. 
XX 

PF 06-JAN-1987; 87EP-0870001 . 
XX 

PR 20-NOV-1986; 86US - 092 9671 . 

PR 07-JAN-1986; 86US-0816662 . 
XX 

PA (UNIW ) UNIV WASHINGTON. 

J XX 

PI Krivi GG, Rotwein PS; 
XX 

DR WPI; 1987-200203/29. 
XX 

PT New pre-pro- insulin-like growth factor- 1 protein - obtd. by 

PT recombinant DNA procedures for use as growth promoters for 

PT enhancing lactation, for stimulating cell proliferation etc. 
XX 

PS Example; Fig 5; 59pp; English. 
XX 

CC A 42 base oligonucleotide corresponding to the DNA sequence encoding 

CC amino acids 10 to 23 of mature human IGF-I was synthesized (AAN70437) . 

CC The radiolabeled 42 mer was then employed to screen for IGF-I 

CC containing DNA sequences in a human liver cDNA library. Insulin- 

CC like growth factors -1A and -IB cDNAs were isolated from a human cDNA 

CC library by using lambdagt 11 (AAN70435, AAN70436) . The human IGF-1 

CC genomic gene was isolated and mapped. It encodes at least two 

CC preproinsulin-like growth factor-1 proteins. An essentially pure 

CC proproinsul in-like growth factor-1 protein comprising the sequence 

CC of amino acids shown in Figure six is claimed (AAP70277) . 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 818 BP; 232 A; 186 C; 187 G; 213 T; 0 other; 



Query Match 66.6%; Score 344.2; DB 8; Length 818; 

Best Local Similarity 87.3%; Pred. No. 4.8e-94; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5 



Qy 


i 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

II MM II INN MINI MINI MM MINI MINIMUM Mill MINI II 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


203 


262 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 

1 M 1 M 1 1 II M 1 M 1 II 1 II 1 II 1 1 M 1 II M M 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGGGGCTTTTATTTC^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


263 


322 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


323 


M Ml IMMM 1 Mill MINI MINI Mill Mill MINI MMMMMI Ml 

ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 


382 


Qy 


181 


TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 24 0 

IMMM Mill II MM II MM Mill MINI Mill II MMMIMMIMM II 



Db 383 TGCGCACCCCTCAAGCCTGCO^GTCAGCTCGCTCTGTCCGTGCCCAGCGCC^CACCGAC 442 

QY 241 ATG CCCAAGACCCAGAAG TAT CAGCCCC CATC 3 00 

I MINI IIIIII I I 

Db 443 ATGCCCAAGACCCAG 457 

QY 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAaVAGAACTACAG 3 60 

IMIIIIIIIIIIIIIIMII Mill I MINI IIIIII I IIIIII III IIIIII 

Db 458 - - - - AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCA.GGAAACAAGAACTACAG 513 

Qy 361 GATGTA - GAAGAC CCTTCTGAGGAGTGAAGAAGGACAGG CCAC CG CAGGACCCTTTG CT C 419 

MINI II MINI I llllllllllllll MM MINI IIIIII Mill I Ml 

Db 514 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGGCACCGCAGGATCCTTTGCTC 573 

Qy 42 0 TGCAC-AGTTACCTG-TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

Mill Ml III III Mill IIIIII Ml Ml I IIIIII IIIIII 1 1 IIIIII 

Db 574 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 633 

Qy 4 78 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT '517 

MUM III lllllll Mill II IIIIII IIIIII I III 

Db 634 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 674 



RESULT 9 
ABT11091 

ID ABT11091 standard; cDNA; 7260 BP. 
XX 

AC ABT11091; 
XX 

DT 04-DEC-2002 (first entry) 
XX 

DE Human breast cancer associated coding sequence SEQ ID NO: 1225. 
XX 

KW Human; breast specific gene; breast cancer; differential expression; 

KW cytostatic; gene therapy; gene; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200259271-A2 . 
XX 

PD 01-AUG-2002. 
XX 

PF 25-JAN-2002; 2 002WO-US02 176 . 
XX 

PR 25-JAN-2001; 2001US-263757P . 
PR 25-APR-2001; 2 001US-2 860 90P . 
PR 23-MAY-2001; 2001US-292517P . 
XX 

PA (GENE-) GENE LOGIC INC. 
XX 

PI Orr MS, Nation M, Diggans JC, Zeng W; 
XX 

DR WPI; 2002-674803/72. 
XX 

PT Diagnosing breast cancer in a patient comprises detecting the level of 
PT gene expression in cell or tissue samples, where a differential gene 
PT expression is indicative of breast cancer 



XX 

PS Claim 1; SEQ ID NO 1225; 260pp + Sequence Listing; English. 
XX 

CC The present invention relates to methods of diagnosing breast cancer in a 

CC patient, which comprise detecting the level of expression in a tissue 

CC sample of two or more genes selected from those shown in ABT09867- 

CC ABT11112, where a differential expression of the genes indicates breast 

CC cancer. The methods are useful in diagnosing, treating, detecting the 

CC progression, and in monitoring treatment of breast cancer in patients. 

CC The methods are also useful as a screening tool for agents that modulate 

CC the onset or progression of breast cancer. The breast cancer genes may be 

CC used as diagnostic markers for the prediction or identification of the 

CC malignant state of breast tissue, for confirming the type and progression 

CC of cancer, and for drug screening and assays. The present sequence is a 

CC coding sequence of the invention. 

CC Note: The sequence data for this patent did not ^form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub.published_pct_sequences. 
XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 



Query Match 66.6%; Score 344.2; DB 24; Length 7260; 

Best Local Similarity 87.3%; Pred. No. l.le-93; 

Matches 4 55; Conservative 0; Mismatches 13; Indels 53; Gaps 5 



Qy 


l 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

MMIMMMMMMMIMMMMIMMMMMMMMMMMMMIMM 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


311 


370 


Qy 


61 


AGGGGCTTTTATTTCAAOVAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


371 


Mill MINI MINIM Mill MINI II MM Ml II II MINI II Mill MM 

AGGGGCTTTTATTTCAACAAGCCCACAGGGT^ 


430 


Qy 


121 


ACAGG CATCGTGGATGAGTG CTGCTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 

MM 1 M MM 1 Mill 1 Mill M 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 1 r 1 Mill 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


431 


490 


Qy 


181 


TGCGC^CCCCTC^GCCTGCC^GTC^GCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 

MM 1 MIIIMI IIMIIIIIMIIIMI Mill Illlllll MINI IIIIIIIMM 

TGCGC^CCCCT(^GCCTGC(^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


491 


550 


Qy 


241 


ATGCCCAAGACCCZAGAAGTATCAGCCCCCATCTACCAACAAGAA 

Ml M Mill M Ml 


300 


Db 


551 




565 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGA 


360 


Db 


566 


Ml MM Illlllll MUM III MINI MIMM llllll IIIIMI Mill 

AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAAt^GAACTACAG 


621 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 


419 


Db 


622 


MIMI III llllll II II Ml III MM MM MM MUM II 

GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 


681 


Qy 


420 


TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 


477 


Db 


682 


Mill MM Mill Mill MIMI III MMMMIIMM MM llllll 

TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAA 


741 



Qy 478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

HIM! Ill Ml MINI IMIMIIIIIIIIIIIIII 

Db 742 AAAGATGGG CGTTT C C CCCAATGAAATACACAAGTAAACAT 782 



RESULT 10 
ABK84583 

ID ABK84583 standard; cDNA; 7260 BP. 
XX 

AC ABK84583; 
XX 

DT 14-AUG-2002 (first entry) 
XX 

DE Human cDNA differentially expressed in granulocytic cells #1154. 
XX 

KW Human; ss; granulocytic cell; DNA chip; bacterial infection; 

KW viral infection; parasitic infection; protozoal infection; 

KW fungal infection; sterile inflammatory disease; psoriasis; 

KW rheumatoid arthritis; glomerulonephritis; asthma; thrombosis ; 

KW cardiac reperfusion injury; renal reperfusion injury; ARDS; 

KW adult respiratory distress syndrome; inflammatory bowel disease; 

KW Crohn's disease; ulcerative colitis; periodontal disease; 

KW granulocyte activation; chronic inflammation; allergy. 

XX 

OS Homo sapiens. 
XX 

PN WO200228999-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 03-OCT-2001; 2 001WO-US3 082 1 . 
XX 

PR 03-OCT-2000; 2000US-237189P. 
XX 

PA (GENE - ) GENE LOGIC INC. 
XX 

PI Beazer-Barclay Y, Weissman SM, Yamaga S, Vockley J; 
XX 

DR WPI; 2002-435328/46. 
XX 

PT Detecting granulocyte activation by detecting differential expression 

PT of genes associated with granulocyte activation, which serves as 

PT diagnostic markers that is useful for monitoring disease states and 

PT drug toxicity 
XX 

PS Claim 1; SEQ ID No 1154; 114pp ; English. 
XX 

CC The invention relates to detecting (Ml) granulocyte (GC) activation 

CC (GCA) , by detecting the level of expression of gene(s) (Gs) identified by 

CC DNA chip analysis as given in the specification, and comparing 

CC the expression level to an expression level in an unactivated 

CC GC, where differential expression of Gs is indicative of GCA. 

CC Also included are modulating (M2) GA by contacting GC with an agent 

CC that alters the expression of at least one gene in Gs; (2) screening (M3) 

CC for an agent capable of modulating GCA or an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease using the 



CC gene expression profile; (3) detecting (M4) an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease, by detecting the 

CC level of expression in a sample of the tissue of gene(s) from Gs, where 

CC the level of expression of the gene is indicative of inflammation; 

CC (4) treating (M5) an inflammation (especially chronic) or in a tissue, 

CC an allergic response in a subject, exposure of a subject to a pathogen 

CC or sterile inflammatory disease, by contacting a tissue having 

CC inflammation with an agent that modulates the expression of gene(s) 

CC from Gs in the tissue. Ml is useful for detecting GCA; M2 is useful for 

CC modulating GA; M3 is useful for screening an agent capable of modulating 

CC GCA preferably in an inflammation in a tissue; M4 is useful for 

CC detecting an inflammation (especially chronic) in a tissue, an allergic 

CC response in a subject, exposure of a subject to a pathogen or sterile 

CC inflammatory disease (e.g. psoriasis, rheumatoid arthritis, 

CC glomerulonephritis, asthma, thrombosis, cardiac reperfusion injury, renal 

CC reperfusion injury, ARDS , adult respiratory distress syndrome, 

CC inflammatory bowel disease, Crohn's disease, ulcerative colitis, 

CC periodontal disease; also bacterial infection, viral infection, 

CC parasitic infection, protozoal infection, fungal infection and M5 is 

CC useful for treating one of the above conditions. The present 

CC sequence represents a gene differentially expressed in granulocytes. 

CC Note: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic 

CC format directly from WIPO at 

CC ftp.wipo. int/pub/published_jpct__sequences . 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 Other; 

Query Match 66.6%; Score 344.2; DB 24; Length 72 60; 

Best Local Similarity 87.3%; Pred. No. l.le-93; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5 
QY 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 




Qy 



61 AGGGGCTTTTATTTCAACAAGCC(^CAGGGTATGGCTC(^GCAGTCGGAGGGCGCCTCAG 12 0 



Db 




Qy 



121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 



Db 




Qy 



181 TGCGCACCCCTC^GCCTGCC^GTCAGCTCGCTCTGTCCGTGCCC^GCGCCAC^CCGAC 24 0 



Db 




Qy 




Db 



Qy 



3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTG(^ 360 



Db 




Qy 



361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 



Db 




Qy 



42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 



Db 




Qy 



478 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



Db 




RESULT 11 
ABN97244 

ID ABN97244 standard; DNA; 7260 BP. 
XX 

AC ABN97244; 
XX 

DT 13-AUG-2002 (first entry) 
XX 

DE Gene #3742 used to diagnose liver cancer. 
XX 

KW Gene; liver cancer; ds ; hepatocellular carcinoma; hepatotropic ; 

KW metastatic liver tumour; cytostatic; expression profile; disease state; 

KW disease progression; drug toxicity; drug efficacy; drug metabolism. 

XX 

OS Homo sapiens. 
XX 

PN WO200229103-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 02-OCT-2001; 2 00 1WO-US3 058 9 . 
XX 

PR 02-OCT-2000; 2000US-237054P . 
XX 

PA (GENE- ) GENE LOGIC INC. 
XX 

PI Home D, Alvares C, Peres-Da-Silva S, Vockley JG; 
XX 

DR WPI; 2002-426119/45. 
XX 

PT Diagnosing and detecting the progression of liver cancer, 

PT hepatocellular carcinoma or metastatic liver tumor in a patient, 

PT involves detecting the level of expression of two or more genes in a 

PT liver tissue sample 

XX 

PS Claim 1; SEQ ID NO 3742; 298pp ; English. 
XX 

CC The invention relates to a novel method for diagnosing and detecting the 
CC progression of liver cancer, hepatocellular carcinoma or metastatic liver 
CC tumour in a patient, and differentiating metastatic liver cancer from 
CC hepatocellular carcinoma in a patient, involving detecting the level of 
CC expression of two or more genes represented in ABN93503-ABN97455 in a 
CC tissue sample. The method of the invention has hepatotropic, and 
CC cytostatic activity. The method is useful for diagnosing and detecting 



CC the progression of liver cancer, hepatocellular carcinoma and metastatic 

CC liver carcinoma in a patient. The method is useful for identifying 

CC expression profiles which serve as useful diagnostic markers as well as 

CC markers that can be used to monitor disease states, disease progression, 

CC drug toxicity, drug efficacy and drug metabolism. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 
XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 66.6%; Score 344.2; DB 24; Length 7260; 

Best Local Similarity 87.3%; Pred. No. l.le-93; 

Matches 455; Conservative 0; Mismatches . 13; Indels 53; Gaps 5 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill MM Mill 1 1 III I Mill Mill MM I Mill Mill Mill Mill I MM 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAAC^GCCCACAGGGTATGGCTCC^GCAGTCGGAGGGCGCCTCAG 12 0 

1 1 M I II M I II 1 1 1 1 II II II 1 1 1 1 MM II II I II Ml II I M II II I II I II II M I 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

QY 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

M 1 1 1 1 1 1 1 1 llllll III Mill I Mill Mill MM I Mill Mill Mill Mill 

Db 431 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 490 

QY 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCC^C^CCGAC 24 0 

MMMMMMIMMMIMMMMIMMMMMMMMMMMMMIMM 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAA 300 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAA 360 

I M 1 1 1 II I II I II 1 1 II M I MM MM II Ml MUM III 1 1 MM I III 1 1 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAA^ 621 

Qy 361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MM II MUM III I Mill I MM III II II I II II 1 1 II II I 1 1 II II 1 1 1 

°b 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

Mill III llllll Mill II MM III Ml MUM 1 1 Ml 1 1 II llllll 

Db 682 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MUM III IIIIIIIIMIIIMIIMIMIIIMIM 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 12 
ABK64 812 

ID ABK64812 standard; DNA; 7260 BP. 
XX 

AC ABK64 812; 



XX 

DT 18-JUN-2002 (first entry) 
XX 

DE Human benign prostatic hyperplasia gene #707. 
XX 

KW Human; benign prostatic hyperplasia; BPH; prostate cancer; gene; ds . 
XX 

OS Homo sapiens. 
XX 

PN WO200212440-A2 . 
XX 

PD 14-FEB-2002. 

XX ' - 

PF 07-AUG-2001; 2 001WO-US24708 . 

XX 

PR 07-AUG-2000; 2 0 00US-223323 P . 

PR 05-JUN-2001; 2 001US- 08 733 1 9 . 
XX 

PA (GENE-) GENE LOGIC INC. 

PA (NISB ) JAPAN TOBACCO INC . 
XX 

PI Munger WE, Kulkarni P, Getzenberg RH, Waga I, Yamamoto J; 
XX 

DR WPI; 2002-257476/30. 
XX 

PT Identifying drugs for and diagnosing benign prostatic hyperplasia, by 

PT detecting expression levels of one or more genes in prostate cells from 

PT patient that are differentially regulated compared to normal prostate 

PT cells - 
XX 

PS Disclosure; Page 391-393; 444pp ; English. 
XX 

CC The invention relates to a method of diagnosing (I) the onset or 

CC progression of benign prostatic hyperplasia (BPH), or screening (II) for 

CC or identifying an agent that modulates the onset or progression of BPH. 

CC The method is based on changes in gene expression in BPH tissue isolated 

CC from patients exhibiting different clinical states of prostate 

CC hyperplasia as compared to normal prostate tissue. (I) comprises 

CC detecting the expression levels of one or more genes in prostate cells 

CC from the subject that are differentially regulated compared to normal 

CC prostate cells. (II) comprises preparing a first gene expression profile 

CC of BPH cells or BPH-like cell population, exposing the cells to the 

CC agent, preparing a second gene expression profile of the agent exposed 

CC cells, and comparing the first and second gene expression profiles. 

CC (I) is useful for diagnosing the onset or progression of BPH . (II) is 

CC useful for identifying an agent that modulates the onset or progression 

CC of BPH. The methods are useful to present information identifying 

CC the expression level in a tissue or cells, by comparing the expression 

CC level of genes given in the specification in the tissue or cells to the 

CC level of expression of gene in the database, and displaying the 

CC expression levels of at least one gene in the tissue or cell sample 

CC compared to the expression level in BPH. Agents using (II) are useful for 

CC treating BPH or prostate cancer. ABK64106-ABK64860 represent human 

CC benign prostatic hyperplasia gene sequences of the invention. 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 



Query Match 66.6%; Score 344.2; DB 24; Length 7260; 

Best Local Similarity 87.3%; Pred. No. l.le-93; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 



5; 



Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

III I Mill IMIIIIIIIIIIIIIIIII III MM MM MINI I II MM I III III 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Ml I MM I II MM III II II MM III Ml IIIIIIII IIIMI Ml MM M 1 1 1 1 1 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 180 

I Ml Ml II MM II IIIMI II MM II IIIMI Ml III II I Mill II I MM I III 

Db 431 ACAGG CAT CGTGGATGAGTGCTG CTT CCGGAG CTGTGAT CTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 'TGCG(>CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Ml I Mill 1 1 IIIMI I MMM I MM II IIIIIIII IIIIIIII II Mill IIIMI 

Db 4 91 TGCG(^CCCCTCAAGCCTGCCAAGT(^GCTCGCTCTGTCCGTGCCCAGCGCC^CACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCT^ 3 00 

; 1 1 - 1 1 1 1 ; i ; 1 1 1 1 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 3 60 

MM I II MM II I Mill M Ml III llllllllllllll Mill III IIIMI 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 621 

Qy 361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MMM II 1 1 Mill llllllllllllll MM Mill III III I MMM III 

Db 622 GATGTAGGAAGACCCT C CTGAGGAGTGAAGAGTGACATG C CACCG CAGGATCCTTTGCTC 681 

Qy 420 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

Mill MIIIMM Mill IIIMI III IIIIIIIIIIIMIMM IIIMI 

Db 682 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 4 78 AAAGAT-GGC^TTTCCCCCAATGAAATACACAAGTAAACAT 517 

IIIMI III MMMMMMMMMMMIMIMM 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 13 
ABK35504 

ID ABK35504 standard; DNA; 7260 BP. 
XX 

AC ABK3 55 04; 
XX 

DT 08-MAY-2002 (first entry) 
XX 

DE Human endometrial cancer related gene, IGF1. 
XX 

KW Human; ds ; gene; endometrial cancer; differential expression; 

KW DNA microarray; protein microarray. 

XX 

OS Homo sapiens . 
XX 

PN WO200209573-A2 . 



XX 

PD 07-FEB-2002. 
XX 

PF 31-JUL-2001; 2001WO-US24 104 . 
XX 

PR 31-JUL-2000; 2000US-221735P . 
XX 

PA (BGHM ) BRIGHAM & WOMENS HOSPITAL INC. 
XX 

PI Mutter GL; 
XX 

DR WPI; 2002-179967/23. 

DR P-PSDB; AAU84284. 
XX 

PT Diagnosing endometrial cancer comprises determining expression of 

PT nucleic acid molecules or expression products that are differentially 

PT expressed in normal and malignant endometrium - 

XX 

PS Claim 1 ; " Page 85-89; 233pp; English. 
XX 

CC The invention relates to diagnosing endometrial cancer in a subject 

CC suspected of having endometrial cancer comprising determining the 

CC expression of a set of nucleic acid molecules or expression products in 

CC an endometrial sample suspected of being cancerous, where the set of 

CC nucleic acid molecules comprises at least 2 nucleic acid molecules 

CC selected from 50 fully defined sequences as given in the specification. 

CC The nucleic acids are used as an array of at least 2 of the 50 

CC nucleic acids bound to a solid substrate. Also included is a solid-phase 

CC protein microarray comprising at least 2 antibodies or its antigen 

CC binding fragments, that specifically bind at least 2 different 

CC polypeptides from the 50 fully defined sequences as given in the 

CC specification, fixed to a solid substrate. The methods and arrays are 

CC useful for the diagnosis of endometrial cancer, selecting and monitoring 

CC treatment regimes and identification of lead compounds useful for the 

CC treatment of endometrial cancer. The present sequence is one of 50 

CC genes differentially expressed between cancerous and non-cancerous 

CC samples . 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 66.6%; Score 344.2; DB 24; Length 7260; 

Best Local Similarity 87.3%; Pred. No. l.le-93; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5 
QY 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 




Qy 



61 AGGKjGCTTTTATTTCAACAAGCCCAC 120 



Db 




Qy 



121 ACAGG CATCGTGGATGAGTG CTG CTT C CGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 



Db 




Qy 



181 TGCGCACCCCTCAAGCCTGCt^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCC^TCTACCAACAAGAACACGAAGTCTCAGAGA 300 

II llllllll 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 3 60 

lllllllll lllllllll III MM III III I II II Ml lllllllll I 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 621 

Qy 361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MMM MMMMI MIMMMMMI MM Illlllllllll J 1 1 1 1 1 1 1 1 

Db 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

Mill MMMMI Mill MINI III 1 1 M 1 1 1 1 1 1 II M I II I II 1 1 1 1 

Db 682 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MMM III II MMMMI I II Mill II III I Mill 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 14 
ABK35561 

ID ABK35561 standard; DNA; 7260 BP. 
XX 

AC ABK35561; 
XX 

DT 08-MAY-2002 (first entry) 
XX 

DE Gene IGF1 differentially expressed in breast cancer tissue. 
XX 

KW Human; diagnosis of breast cancer; endometrial cancer; breast tumour; 

KW MAI; mitotic activity index; cytostatic; gene; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO200210436-A2. 
XX 

PD 07-FEB-2002. 
XX 

PF 27-JUL-2001; 2 00 1WO-US23 642 . 
XX 

PR 28-JUL-2000; 2000US-222093P . 
XX 

PA (BGHM ) BRIGHAM & WOMENS HOSPITAL INC. 

PA (BAAK/) BAAK J . 

XX 

PI Baak J, Mutter GL; 
XX 

DR WPI; 2002-180084/23. 
DR P-PSDB; AAU84341. 
XX 

PT Diagnosing breast cancer comprises determining expression of nucleic ' 
PT acid molecules or expression products that are differentially expressed 



PT in normal and malignant tissue - 
XX 

PS Claim 1; Page 74-78; 219pp; English. 
XX 

CC The present invention relates to a method for diagnosing breast cancer 

CC in a subject suspected of having endometrial cancer. The method 

CC comprises determining the expression of a set of human genes or 

CC expression products in an endometrial sample suspected of being 

CC cancerous. The human genes of the invention are differentially 

CC expressed in breast tumours characterised as high or low MAI (mitotic 

CC activity index) . These sets of genes can be used to discriminate between 

CC high and low MAI breast tumours. The invention also provides DNA and 

CC protein microarrays for analysing the expression of the human genes and 

CC their protein products. The methods and arrays are useful for the 

CC diagnosis and prognosis of endometrial cancer, selecting and monitoring 

CC treatment regimes, and identification of compounds useful for the 

CC treatment of endometrial cancer. ABK3553 1-ABK35581 represent the human 

CC genes 'of the invention that are differentially expressed in breast 

CC cancer tissue. 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 Other; 

Query Match 66.6%; Score 344.2; DB 24; Length 7260; 

Best Local Similarity 87.3%; Pred. No. l.le-93; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II I MINI IIIIMII III MM I MINI MINIM MINI lllllll MINIM 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 37 0 

Qy 61 AGGGGCTTTTATTTCAAC^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTaVG 120 

M M I M II I II II II I II II I I II I II I II I I I I II I I I I II II I II II II I II I II I I 

Db 3 71 AGGGGCTTTTATTTCAA(^GCC(^CAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 180 

MMIMMMMMMMMIMMIMIMMMMMIMMMMMMMIMM 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I II I I I I 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAA 3 00 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGG 360 

1 1 1 1 1 I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 621 

Qy 361 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

I I I I I I I I I I I I I II I M I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I I I I 

Db 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 420 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

Mill lllllllll Mill IIMII III llllllllllllllllll IIMII 

Db 682 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 



Qy 478 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MINI III llllllllllllllllllllllllllllll 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 15 
AAT84894 

ID AAT84894 standard; cDNA; 777 BP. 
XX 

AC AAT84894; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Human insulin like growth factor 1 Ea isoform encoding cDNA. 
XX 

KW Insulin like growth factor 1; IGF-1; Ec peptide; muscle disorder; 

KW heart; neuromuscular, disease; ss. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 26 . .496 

FT /*tag= a 

FT /product- " IGF-1 Ea isoform" 

XX 

PN W09733997-A1 . 
XX 

PD 18-SEP-1997. 
XX 

PF ll-MAR-1997; 97WO-GB00658 . 
XX 

PR ll-MAR-1996; 96GB-0005124 . 
XX 

PA (UNLO ) ROYAL FREE HOSPITAL SCHOOL MED. 
XX 

PI Goldspink G; 
XX 

DR WPI; 1997-470877/43. 
DR P-PSDB; AAW23302. 
XX 

PT Use of insulin like growth factor I characterised by presence of Ec 
PT peptide - to treat humans or animals, particularly muscle disorders, 
PT , heart conditions or neuromuscular diseases 
XX 

PS Disclosure; Fig 4; 33pp; English. 
XX 

CC A use of insulin like growth factor I (IGF-1) has been developed, and 
CC is characterised by the presence of the Ec peptide, or a functional 
CC equivalent, in the treatment or therapy of a human or animal. The IGF-1 
CC polypeptide can be used to treat muscular disorders, e.g. Duchenne or 
CC Becker muscular dystrophy, autosomal dystrophies and related progressive 
CC skeletal muscle weakness and wasting, muscle atrophy in ageing humans, 
CC spinal cord injury induced muscle atrophy and neuromuscular diseases, 
CC and cardiac disorders, e.g. diseases where promotion of cardiac muscle 
CC,, protein synthesis is a beneficial treatment, cardiomyopathies and acute 
CC heart failure or insult, specifically myocarditis or myocardial 



CC infarction. It can also be used to promote bone fracture healing and 

CC maintenance of bone in old age. The present sequence encodes human 

CC IGF-1 Ea isoform used in the present specification. 
XX 

SQ Sequence 777 BP; 201 A; 193 C; 204 G; 179 T; 0 other; 

Query Match 66.3%; Score 342.6; DB 18; Length 777; 

Best Local Similarity 87.1%; Pred. No. 1.4e-93; 

Matches 454; Conservative 0; Mismatches 14; Indels 53; Gaps 5 
QY 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I Mill Ml Ml II I Ml I lill Ml Mill I MMM llllll llllll 1 1 Mill III 

Db 17 9 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 23 8 

QY 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCT^ 12 0 

I Mill I Mill I MMM MMM MMM I Mill I MMM I MMM Mill I Ml 

Db 23 9 AGGGGCTTTTATTTCAAf^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 2 98 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Mill MMM llllll I MMM MMM I III Ml MM II Mill II llllll III 

Db 2 99 ACAGGTATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 3 58 

Qy 181 TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGra 24 0 

MMMMMMMMIMMMMIMMIMIMIMMMIMIMIMMMIMI 

Db 359 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 418 

Qy 241 ATGCCC71AGACCCAGAAGTATCAGCCCCCATCTAC 3 00 

Mill I MMM III 

Db 419 ATGCCCAAGACCCAG 433 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGT^ 36 0 

1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 M 1 1 Ml II III II IIIIMII III II II I Mill I II 

Dh 434 - - - -AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 489 

Qy 361 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MMM IMMMM I II III 1 1 MMM 1 1 1 1 I II 1 1 II II II I II II I II 1 1 

Db 4 90 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 54 9 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

1 1! 1 1 II I M I II I Mill llllll III lillllllllllMIIII MMM 

°t> 550 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 6 09 

QY 478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MMM III I II III III MM Mill II MM 1 1 MM 

Db 610 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 65 0 
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RESULT 1 

US-09-142-583A-3 

; Sequence 3, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP : 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER : IBM PC compatible 



OPERATING SYSTEM: PC-DOS /MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/142 , 583A 
FILING DATE: 29-Oct-1998 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 
FILING DATE: ll-MAR-1997 
APPLICATION NUMBER: GB 9605124.8 
FILING DATE: ll-MAR-1996 
ATTORNEY/ AGENT INFORMATION: 
NAME: SADOFF, B. J. 
REGISTRATION NUMBER: 36663 
REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 7038164000 
TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 3: . . 

SEQUENCE CHARACTERISTICS: 

LENGTH: 553 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : both 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME /KEY : CDS 
LOCATION: 1 . .363 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-09-142-583A-3 

Query Match 90.4%; Score 467.4; DB 3; Length 553; 

Best Local Similarity 96.2%; Pred. No. 1.2e-134; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2 

QY 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
D k 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

QY 61 AGGGGCTTTTATTTCAAC^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTOVG 120 

I HUM IMIMII llllllllllllll II IIIIIIIIIIMIIIIIIII I II I II 

Db 91 AGGGGCTTTTATTT(^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 150 

QY 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

, I llllllllllllll IMIMII MINI IMIMII MM 1 1 II 1 1 M 1 1 1 II 1 1 M 

Db 151 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 210 

QY 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M I Ml I Mill MM II III MM Mill II II M 1 1 II II 1 1 II MM II II 

Db 211 TGTGCACCCCTCAAGCCGGOUVAGGCAGCCCGCTCCGTCCGTGCCC7VGCGCCACACCGAC 270 
QY 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACC G 2 97 

1 1 1 1 1 1 M II I M II II 1 1 1 II I M II 1 1 II 1 1 M II II II I I M 1 1 1 II M I 

Db 271 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAA 33 0 

QY 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAA^ 357 

I I M I M 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 



Db 



331 AGAAGGAAAGGAAGTA(^TTTGAAGAACA(^VGTAGAGGGAGTGCAGGAAACAAGAACTA 3 90 



Qy 



358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 



Db 




Qy 



417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 476 



Db 




Qy 



4 77 CAAAGATGGCATTTCGCCCAATGAAATACACAAGTAAACAT 517 



Db 




RESULT 2 

US-09-142-583A-5 

; Sequence 5, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDS PINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 110 0 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996- 
ATTORNEY /AGENT INFORMATION: 

NAME: SADOFF, B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 553 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : both 

TOPOLOGY: linear 



MOLECULE TYPE: cDNA 
FEATURE: 

NAME / KEY : CDS 
LOCATION: 341. .397 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-142-583A-5 



Query Match 90.4%; Score 467.4; DB 3; Length 553; 

Best Local Similarity 96.2%; Pred. No. 1.2e-134; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 



Qy 


1 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

Mill lllllll Mill! II II IMIMIIIII II MM III lllllllllllllllll 

GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


31 


90 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 

! 1 1 M M 1 1, 1 1 1 1 1 II II II II 1 II 1 1 1 II 1 II II II II 

AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 


120 


Db 


91 


150 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 

III III lllllllllllllllll Illlllllllllllllll MMMMMMIMM 

ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTGAGGAGG CTGGAGATGTAC 


180 


Db 


151 


210 


Qy 


181 


TGCGCACCCCT(^GCCTGCO^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

II I! Mill III II Ml MM Mill MMIMMIMM lllllll 

TGTG(^CCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


211 


270 


Qy 


241 


ATGCCCAAGACCCAGAAGTATC^GCCCCCATCTACa^CAAGAA(^CGAAGTCT(^ G 


297 


Db 


271 


M 1 1 II 1 II II M II II 1 1 1 1 1 1 1 1 II II II 1 1 II 1 II II II 1 1 II 1 1 II 1 i 1 

ATG CC CAAGACTCAGAAGTATCAG CCTC CAT CTACGAACAAGAAAATGAAGTCTCAGAGG 


330 


Qy 


298 


AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


357 


Db 


331 


IMIM Illlllllllllllllll Mill Mill II MM Ml II III MM Illll III 

AGAAGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGGAGTGCAGGAAACAAGAACTA 


390 


Qy 


358 


CAGGATGTA - GAAGACC CTTCTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 


416 


Db 


391 


Ml Ml III II Illll III III Illll MUM Ml III MMMMMMIMM III 

CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 


450 


Qy 


417 


CTCTGCAC^GTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 

1 i 1 M 1 MMMMMMIMM Illll II Illlllllllllllllll 

CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 


476 


Db 


451 


510 


Qy 


477 


CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

1 M 1 M 1 II 1 II II II M M 1 II II M 1 1 II II 1 1 M 1 II 1 

CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 551 




Db 


511 





RESULT 3 

US-09-142-583A-10 

; Sequence 10, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 



ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 96 05124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/ AGENT INFORMATION: 

NAME: SADOFF, B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
; INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 777 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: both 

TOPOLOGY: linear 
MOLECULE TYPE : cDNA 
FEATURE: 

NAME/ KEY: CDS 

LOCATION: 26. .4 93 
SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
US-09-142-583A-10 

Query Match 66.3%; Score 342.6; DB 3; Length 777; 

Best Local Similarity 87.1%; Pred. No. 4.4e-96; 

Matches 454; Conservative 0; Mismatches 14; Indels 53; Gaps 5 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 




Qy 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 



Db 




Qy 



121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 



Db 




Qy 



181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



I III MM I Ml I III 1 1 II I III MM I III MM MM MM MM Mil MM MM 

Db 359 TGCGCACCCCTC^GCCTGCCAAGTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 418 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACC^ACAAGAACACGAAGTCTCAGAGA 300 

llllllll ill i 

Db 419 ATGCCCAAGACCCAG 433 

Qy 301 AGGAAAGGAAGTAC^TTTGAAGAAC^CAAGTAGAGGGAGTGCAGGAAAG^GAACTACAG 360 

I M I I I I I I II I I I I I I II I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 

Db 434 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 489 

Qy 361 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MINI I III III III Ml llili MM M llllllll II 1 1 Ml MM 

Db 4 90 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 54 9 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

Mill II I III Ml Mill MUM III MUM Mill III MM llllll 

Db 550 TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 609 

Qy 478 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

llllll Ml III Ml IMMIIMM IMIMI 

Db 610 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 650 



RESULT 4 
5405942-2 
Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B , ; MERRYWEATHER , 
JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
' FILING DATE: 19-JUL-1984 
SEQ ID NO: 2 : 

LENGTH: 622 
5405942-2 

Query Match 65.6%; Score 339.4; DB 6; Length 622; 

Best Local Similarity 69.7%; Pred . No. 3.9e-95; 

Matches 363; Conservative 89; Mismatches 16; Indels - 53; Gaps 5; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I MM III II II hh 1 1 II MM I MM 11 = I IhlhhMI MM hhhIMIII 

Db 4 5 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 104 

Qy 61 AGGGGCTTTTATTTCAA(^GCC(^CAGGGTATGGCTCCAGC^GTCGGAGGGCGCCTCAG 120 

1 1 1 MM- M-M I M I II I M I II IMMI IM 1 1 1 1 1 Ml 1 1 II 1 1 1 1 MM I 

Db 105 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCU 164 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Mill M Ml MMMMMMMMMMMMM IMIMMMMMMM 

Db 165 ACAGGUAUCGUGGAUGAGUGCUG CUUC CGGAG CUGUGAUCAUAGGAGG CUGGAGAUGUAU 224 



Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M! MM MM MM MlMMMMMMMMMMMMMMMMMM 

Db 225 UGCGCACCCCUCAAGCCUGCCAAGUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 284 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACa^C^GAACACGAAGTCTCAGAGA 300 

hi MM MM 

Db 28 5 AUGCCCAAGACCCAG 2 99 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGT^ 360 

, , MlhlllllllhlllMlllllllllhllll 

Db 3 00 AAGGAAGUACAUUUGAAGAACG CAAGUAGAGGGAGUG CAGGAAACAAGAACUACAG 355 

Qy 361 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

Ihhl llllillh MMMMMIMi 1 1 II llllllllllll Ih-IIM 

Db 3 56 GAUGUAGGAAGACCCUCCUGAGGAGUGAAGAGUGACAUGCCACCGCAGGAUCCUUUGCUC 415 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

Mill IhMIIM Mill -MM III lllllllhlll-Mh MM- 

Db 416 UGCACGAGUUACCUGUUAAACUUUGGAACACCUACC^^ 475 

Qy 478 AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MiM III -MIIIIIIMIIIMMMIIMIMh 

Db 4 76 AAAGAUGGG CGUUUCC CCCAAUGAAAUACACAAGUAAACAU 516 



RESULT 5 

US-08-472-809B-8 

; Sequence 8, Application US/08472809B 

; Patent No. 5925564 

; GENERAL INFORMATION: 

APPLICANT: Schwartz, Robert J. 

APPLICANT: DeMayo, Franco J. 

APPLICANT: O'Malley, Bert W . 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 

STREET: Suite 4700 

CITY: Los Angeles 

STATE: California 

COUNTRY: U.S.A. 

ZIP : 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: IBM P.C. DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: 03/08/472,8 0 98 

FILING DATE: June 7, 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/2 09,84 6 



FILING DATE: March 9, 1994 
APPLICATION NUMBER: 07/789,919 
FILING DATE: No. 5925564ember 6, 1991 
ATTORNEY/AGENT INFORMATION: 
NAME: Warburg, Richard J. 
REGISTRATION NUMBER: 32,327 
REFERENCE/DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (213) 489-1600 
TELEFAX : (213) 955-0440 
TELEX: 67-3510 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERI STI CS : 
LENGTH: 5707 bases 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
US-08-472-809B-8 

Query Match 55.4%; Score 286.4; DB 2; Length 5707; 

Best Local Similarity 85.6%; Pred. No. 2.3e-78; 

Matches 363; Conservative 0; Mismatches 11; Indels 50; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II MM MM I III III! MM Ml I MINI I INI II II II II I II I MM 

Db 793 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 852 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCA 12 0 

II II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 

Db . 853 AGGGGCTTTTATTTCAAC7U\GCCCACAGGGTATGGCTC 912 

Qy 121 ACAGGCAT CGTGGATGAGTG CTG CTTC CGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 180 

I I III II I MM I III III Mill I III I MINIM MM II II I II II MM MM II 

Db 913 ACAGGCAT CGTGGATGAGTGCTGCTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 972 

Qy 181 TGCG(^CCCCT(^GCCTGCC^GT(^GCTCGCTCTGTCCGTGCCCAGCGCC^CACCGAC 24 0 

III IMM MM I III MMIMM 111 MMIMM MM II II I II II I III MM II 

Db 973 TGCGCACCCCT(^UVGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCC^CACCGAC 1032 

Qy 241 ATGCCCAAGACCt^GAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTC 3 00 

IMIIIIIIIMIII 

Db 1033 ATGCCCAAGACCCAG 104 7 

Qy 301 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTG(^GGAAACAAGAACTACAG 360 

II I MMIMM II I II I II I III MM MM MUM III MM MM Mill I 

Db 1048 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 1103 

Qy 361 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

II I I II I I I I I I II I I I I I I II I M I I II II M I II II II I I I II I I I I 
Db 1104 GATGTAGGAAGAC CCTCCTGAGGAGTGAAGAGTGACATGCCAC CGCAGGAT CCC CCGGG C 1163 

Qy 420 TGCA 423 

MM 

Db 1164 TGCA 1167 



RESULT 6 

US-08-472-809B-7 

; Sequence 7 , Application US/08472809B 

; Patent No. 5925564 

; GENERAL INFORMATION : 

APPLICANT: Schwartz, Robert J. 

APPLICANT: DeMayo, Franco J. 

APPLICANT: O'Malley, Bert W. 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 

STREET: Suite 4700 

CITY: Los Angeles 

STATE: California 

COUNTRY: U.S.A. 

ZIP: 90071-2066 
COMPUTER READABLE FORM : 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: IBM P.C. DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/472 , 8 09B 

FILING DATE: June 7 , 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 

FILING DATE: March 9, 19 94 

APPLICATION NUMBER: 07/789,919 

FILING DATE: No. 5 925564ember 6, 19,91 
ATTORNEY/AGENT INFORMATION: 

NAME: Warburg, Richard J. 

REGISTRATION NUMBER: 32,327 

REFERENCE/DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (213) 489-1600 

TELEFAX: (213) 955-0440 

TELEX: 67-3510 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 6345 bases 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-472-809B-7 

Query Match 55.4%; Score 286.4; DB 2; Length 6345; 

Best Local Similarity 85.6%; Pred. No. 2.4e-78; 

Matches 363; Conservative 0; Mismatches 11; Indels 50; Gaps 2 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I Mill I II III I Mill Mill I Mill I Mill I Mill Mill MUM III Mill 



Db 



37 02 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3761 



Qy 61 AGGGGCTTTTATTTCAAt^GCCOVCAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

IMIIIlMlilllllllM MMIMIMI MMMIIMI illllllllll 

Db 3762 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCA^ 3821 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

MM MM MMMIIMI IMMMIMI IMIIIIMM MMIMIMI 

Db 3822 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 3881 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM MM I III MIMI I MMIMIMI MMMIIMI MMIMIMI 

Db 3882 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 3941 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 300 

MMIMMMIMI 

Db 3 942 ATGCCCAAGACCCAG 3956 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 360 

.M MINIMUM! :M 1 1 1 1 1 1 1 1 1 1 1 1 1 N II N I N I II II 1 1 1 1 1 1 1 

Db 3 957 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 4 012 

Qy 3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

MM 1 i 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 i 1 1 MM MUM Ml II I I 

Db 4 013 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCCCCGGGC 4 072 

Qy 420 TGCA 423 

MM 

Db 4073 TGCA 4076 



RESULT 7 
5405942-13 
Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN -LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65, 673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 13: 

LENGTH: 357 
5405942-13 

Query Match 49.4%; Score 255.2; DB 6; Length 357; 

Best Local Similarity 98.8%; Pred. No. 2.8e-69; 
j Matches 257; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 IE 1 1 1 M 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 43 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 102 



Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

I IMM I M 1 1 1 1 1 1 1 1 1 1 1 M I IMM 1 1 1 1 M 1 1 1 II I III I 1 1 MM 1 1 1 1 1 II 

Db 103 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 162 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 163 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 222 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

I llllll III III IMIIM IMIMIMM IMIMIMM IMMIMMMI 

Db 223 TGCGCACCCCTCAGGCCTGCCAAGTCAGCTGGCTCTGTCCGTGCCCAGCGCCACACCGAC 282 

Qy 241 ATGCCCAAGACCCAGAAGTA 2 60 

I I i 1 1 1 1 1 1 1 Mill; I I 

Db 283 ATGCCCAAGACCCAGAAGGA 3 02 



RESULT 8 
5405942-9 
Patent No. 5405942 

APPLICANT : BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN -LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES : 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19 -JUL- 1984 
SEQ ID NO: 9: 

LENGTH: 357 
5405942-9 

Query Match 49.1%; Score 253.6; DB 6; Length 357; 

Best Local Similarity 79.2%; Pred. No. 8.8e-69; 

Matches 206; Conservative 50; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

Db 43 GGACCGGAGACG CUCUG CGGGG CUGAG CUGGUGGACG CUCUUCAGUUCGUGUGUGGAGAC 102 

Qy 61 AGGGGCTTTTATTT(^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

-M I M 

Db 103 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGCU^ 162 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Mill IMIMIIMIIMIM -M M 1 1 1 1 M M M Ml I M 1 1 1 M M I M M M 

Db 163 ACAGGUAUCGUGGAUGAGUGCUGUUUCCGGAGCUGUGAUCUAAGGAGGCUGGAGAUGUAU 222 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

M II 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 1 1 M 1 1 1 M M M M M M M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

Db 223 UGCGCACCCCUC^GCCUGCCAAGUC^GCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 282 

Qy 241 ATGCCCAAGACCCAGAAGTA 260 

IMIIM II I II II I I II I 



Db 



.2 83 AUGCCCAAGACCCAGAAGGA 3 02 



RESULT 9 
5405942-7 

Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 

JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES : 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 7: 

LENGTH: 210 
5405942-7 



Query Match 40.3 3 

Best Local Similarity 77.6' 
Matches 163; Conservative 



; Score 208 .4; DB 6; 
; Pred. No. 6e-55; 
46; Mismatches 1; 



Length 210; 
Indels 0; 



Gaps 



0 



QY 
Db 



1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

MMIIIMIII|:|:MIMIhlllhlhllhlhh:|lh:||:h|:|||l!l 
1 GGACCGGAGACG CUCUG CGGGG CUGAG CUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 60 



Qy 

Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAAC^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 
61 AGGGGCUUUUAUUUCAACAAGCCCAC^GGGUAUGGCUCCAGCAG 120 

121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Mill inhllhllhlhlhHIIIIIhhIhhIMMIIhllllhhl: 
121 ACAGGUAUCGUGGAUGAGUG CUG CUUCCGGAGCUGUGAUCUAAGGAGGCUGGAGAUGUAU 180 

181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 210 

Ulilllllhlllllhlllllhlllh 
181 UGCGCACCCCUCAAGCCUGCCAAGUCAGCU 210 



RESULT 10 

5405942-11 

; Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
; JAMES P. ' 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
; I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 11: 



LENGTH : 
5405942-11 



210 



Query Match 4 0.3%; Score 2 08.4; DB 6; Length 210; 

Best Local Similarity 99.5%; Pred. No. 6e-55; 

Matches 209; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I Ml MM I M I 1 1 1! 1 1 1 1 1 1 M 1 1 Ml 1 1 1 1 1 1 1 II 1 1 1 II I MM II 1 1 1 IM 

Db 1 GGAGCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

M 1 1 1 1 1 IM M I MM 1 1 M 1 1 M 1 1 1 1 1 1 M M 1 1 1 MM II M 1 1 M MM I M 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Mill IM IMMIMMI IMIIIIIM MMMMMMMMMIMM 

Db 121 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 210 

MMIIIM IMIMIMM MMM 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 210 



RESULT 11 
US-09-255-829-13 

Sequence 13, Application US/09255829 
Patent No. 6461617 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



Shone, Clifford Charles 
Quinn, Conrad Padraig 
Foster, Keith Alan 
TITLE OF INVENTION: Recombinant Toxin Fragments 
NUMBER OF SEQUENCES: 2 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: STERNE, ' KESSLER, GOLDSTEIN, & FOX P.L.L.C. 
STREET: 1100 NEW YORK AVENUE, NW, SUITE 600 
CITY: WASHINGTON 
STATE: DC 
COUNTRY: USA 
ZIP: 20005-3934 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/255 , 829 

FILING DATE: 23-FEB-1999 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/GB97/02273 

FILING DATE: 22-AUG-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/782,893 

FILING DATE: 27-DEC-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: ESMOND, ROBERT W. 



REGISTRATION NUMBER: 32,893 
REFERENCE/DOCKET NUMBER: 1581.0130002 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 2 02-371-2 600 
TELEFAX: 202-371-2540 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2862 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

NAME /KEY: CDS 
LOCATION: 1 . .2862 
US-09-255-829-13 

Query Match 40.3%; Score 208.4; DB 4; Length 2862; 

Best Local Similarity 99.5%; Pred. No. 1.9e-54; 

Matches 209; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIM 

Db 2644 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 2703 

Qy 61 AGGGGCTTTTATTTCAA(^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

1 1 1 1 1 1 I MM I M IIMi ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M II I h I I II 1 1 1 1 ! 1 1 

Db 27 04 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCC^GCAGTCGGAGGGCGCCTCAG 2763 

Qy 121 ACAGG CATCGTGGATGAGTG CTG CTT CCGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 18 0 

IIMI 

Db 2764 ACAGGTAT CGTGGATGAGTG CTG CTTC CGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 2823 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 210 

Db 2824 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 2853 



RESULT 12 

5405942-15 

; Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ;RALL, LESLIE B . ; MERRYWEATHER, 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 15 : 

LENGTH : 210 
5405942-15 



Query Match 



40.0%; Score 206.8; DB 6; Length 210; 



Best Local Similarity 77.1%; Pred. No. 1.9e-54; 

Matches 162; Conservative 46; Mismatches 2; Indels 0; Gaps 



0 



Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IMIIIIIMII|:hllllllhlll|:!|:|||:||:|::Mh:|hhhllllll 
Db 1 GGACCGGAGACGCUCUGCGGGGCUGAGCUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

II I I I h = = : I = = = I I I I I I I I I I I I I I I I : I = I I I : I I I I I I h I II I I I I M I I : I I I 
Db 61 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAU^^ 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

I I I I I'h I h I I h I I h I h I h : I I I I I I I : I : I h I = I I I I I I I I = I I I I I : I : I = 
Db 121 ACAGGUAUCGUGGAUGAGUGCUG CUUC CGGAG CUGUGAUCUAAGGAGG CUGGAGAUGUAU 180 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCT 210 

: I II I I I I I I : I I I I I : I I I I I I = I M h 

Db 181 UGCGCACCCCUCAGGCCUGCCAAGUCAGCU 210 



RESULT 13 
US-08-308-196A-1 

Sequence 1, Application US/08308196A 
Patent No. 5612198 
GENERAL INFORMATION: 

APPLICANT: Brierley, Russell A. 
APPLICANT: Davis, Geneva R. 
APPLICANT: Holtz, Gregory C. 
APPLICANT: Gleeson, Martin A. 
APPLICANT: Howard, Bradley D. 

TITLE OF INVENTION: Production of Insulin-Like Growth 
TITLE OF INVENTION: Factor-1 in Methyl otrophic Yeast Cells 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Brown, Martin, Haller & McClain 
STREET: 1660 Union Street 
CITY: San Diego 
STATE: California 
COUNTRY: USA 
ZIP: 92101-2926 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy. disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/3 08 , 196A 
FILING DATE: 09-SEPT-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/983,523 
FILING DATE: 03-MAR-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578,728 
FILING DATE: 04-SEP-1990 
ATTORNEY/AGENT INFORMATION: 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33,779 



REFERENCE/DOCKET NUMBER: 51875 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (619)238^0999 

TELEFAX: (619)238-0062 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 24 0 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: unknown 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 14 . .232 
US-08-308-196A-1 

Query Match 39.2%; Score 202.8; DB 1; Length 240; 

Best Local Similarity 96.7%; Pred. No. 3.4e-53; 

Matches 207; Conservative 0; Mismatches 7; Indels 0; Gaps 0 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MIIIMIIIIIIIIIIIIIIIIIIIMI lllllllllll 

Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGO^GTCGGAGGGCGCCTCAG 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 77 AGGGGCTTTTATTTCAACAAGCCCAO^GGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

IIIIIIIIMIIII MINI Mill IIIMIMIIMMIIIIIIIIIM ,11111! 

Db 137 ACAGG CATCGTGGATGAGTG CTGCTTCCGGAG CTGTGAT CTAAGGAGG CT CGAGATGTAT 196 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCT 214 

Mill 1 1 II lllllllllll II I II I II II I I 

Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 23 0 



RESULT 14 
PCT-US91-06452-1 

Sequence 1, Application PC/TUS9106452 
GENERAL INFORMATION: 

APPLICANT: Brierley, Russell A. 
APPLICANT: Davis, Geneva R. 
APPLICANT: Holtz, Gregory C. 
APPLICANT: Gleeson, Martin A. 
APPLICANT: Bradley, 



TITLE OF INVENTION: 
TITLE OF INVENTION: 
NUMBER OF SEQUENCES: 



D. H. 

Production of Insulin-Like Growth 
Factor-1 in Methylotrophic Yeast Cells 
12 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fitch, Even, Tabin & Flannery 

STREET: 135 South LaSalle Street, Suite 900 

CITY: Chicago 

STATE: Illinois 

COUNTRY: USA 

ZIP: 60603 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE : Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US91/ 06452 
FILING DATE: 19910409 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578., 728 
FILING DATE: 04-SEP-1990 
ATTORNEY/AGENT INFORMATION : 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33,779 
REFERENCE/ DOCKET NUMBER: 51874 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (619)552-1311 
TELEFAX: (619)552-0095 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 24 0 base pairs' 
TYPE: NUCLEIC ACID 
STRANDEDNESS : doubl e 
TOPOLOGY : unknown 
MOLECULE TYPE: cDNA 
FEATURE: 

NAME / KEY : CDS 
LOCATION: 14..232 
PCT-US91-06452-1 

Query Match 39.2%; Score 202.8; DB 5; Length 240; 

Best Local Similarity 96.7%; Pred. No. 3.4e-53; 

Matches 207; Conservative 0; Mismatches 7; Indels 0; Gaps 0 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MINIMI MINIM MINI 1 1 MM MINIMI! 1 1 1 II 1 1 1 1 1 N II II 1 1 

Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

1 1 1 i 1 1 1 1 f 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 III I INN II 

Db 77 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 13 6 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

I Ml INN III III II I INN INN I II Mil MM II I II Ml II I lllllllll 

Db 137 ACAGGCATCGTGGATGAGTG CTG CTT CCGGAG CTGTGATCTAAGGAGGCTCGAGATGTAT 196 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCT 214 

MMMMMMIMM IMM MM I I 

Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 23 0 



RESULT 15 
US-09-029-267-13 

; Sequence 13, Application US/09029267 
; Patent No. 6107057 
; GENERAL INFORMATION : 

APPLICANT: Crawford, Kenneth 



APPLICANT: Zaror, Isabel 
APPLICANT: Innis, Michael 

TITLE OF INVENTION: Pichia Secretory Leader for Protein 
TITLE OF INVENTION: Expression 
NUMBER OF SEQUENCES: 4 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Chiron Corporation 

STREET: 4560 Horton Street 

CITY: Emeryville 

STATE: California 

COUNTRY: United States 

ZIP: 94608 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/02 9 , 267 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY /AGENT INFORMATION: 

NAME: Chung, Ling-Fong 

REGISTRATION NUMBER: 36,482 

REFERENCE/DOCKET NUMBER: 1165.100 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (510) 601-2704 

TELEFAX: (510) 655-3542 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 3 90 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "Synthetic" 
US-09-029-267-13 

Query Match 3 9.2%; Score 2 02.8; DB 3; Length 3 90; 

Best Local Similarity 96.7%; Pred. No. 4.2e-53; 

Matches 207; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 



160 




Qy 



61 



AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 



Db 



220 




Qy 



121 



ACAGG CAT.CGTGGATGAGTGCTG CTT CCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 180 



Db 



280 




Qy 



181 




Db 



340 



Search completed: December 13, 2003 , 11:44:49 
Job time : 49.8037 sees 



GenCore version 5.1.6 
Copyright (c) 19 93 - 20 03 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 13, 2003, 07:29:55 ; Search time 230.833 Seconds 

(without alignments) 
7443.919 Million cell updates/sec 

US-09-852-261-1 
517 

1 ggaccggagacgctctgcgg tgaaatacacaagtaaacat 517 

I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 2201672 seqs, 1661799599 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



4403344 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2 /pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq:* 

8 : / cgn2_6 /p t oda t a / 2 /pubpna /US 0 8_PUBCOMB .seq:* 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq : * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq: 
11: / cgn2_6 /p t oda t a / 2 /pubpna /US 0 9 C_PUBC0MB . s eq : 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq2 
14 : /cgn2__6/ptodata/2/pubpna/US10A_PUBCOMB. seq 
15: /cgn2__6/ptodata/2/pubpna/US10B_PUBCOMB.seq 
16 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB . seq : * 
17 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-852-261-1 

; Sequence 1, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 



APPLICANT: TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 1 
LENGTH: 517 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-852-261-1 

Query Match 100.0%; Score 517; DB 9; Length 517; 

Best Local Similarity 100.0%; Pred. No. 2.4e-160; 

Matches 517; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II I II IIIIIIIIIIIIIIIIIIIIIIIIII II I Mill lllllil II Mil I MINI I 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCC^CAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

II I II I I I M I I M I II II II II I II II II I I I I II II I I II II I I I II II II I II II II 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

II MMMM I IIIIIII IIIIIII Ml 111 MIMM 1 1 II Ml IIIIIII IIIIIII I 

Db 121 ACAGG CATCGTGGATGAGTG CTG CTTC CGGAG CTGTGAT CTAAGGAGG CTGGAGATGTAT 18 0 

Qy 181 TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Db 181 TGCGC^CCCCTCAAGCCTGCCAAGTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCA.TCTACCAACAAGAACA 3 00 

! M 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 ; I ! 1 1 ! I : M 1 1 M M M 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 

Db 241 ATGCCOU^GACCCAGAAGTATCAGCCCCCATCTACCAAC 3 00 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAA^ 360 

II I II I I II I II II II I II II II I II II I I I I I I II II I II II I I I II II I I I I II II II 

Db 3 01 AGGAAAGGAAGTAC^TTTGAAGAAC^C^GTAGAGGGAGTGCAGGAAACAAGAACTACAG 360 

Qy 3 61 GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 420 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

Db 3 61 GATGTAGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCT 420 

Qy 421 GCAC^GTTACCTGTAAACATTGGAATACCGGCCA^ 480 

II IIIIIII I IIIIIII Mill II IIIIIII II I II Ml II MM IIIIIII I IIIIIII 

Db 421 GCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT 4 8 0 

Qy 481 GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

Mill Mill MUM IIIIIII Mill Mill MM 

Db 4 81 GATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



RESULT 2 



US-09-852-261-5 

; Sequence 5, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/ 09/852 , 26 1 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

; PRIOR FILING DATE: 2000-05-10 

; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2 ..1 
; SEQ ID NO 5 

LENGTH: 523 

TYPE: DNA 

ORGANISM: Oryctolagus cuniculus 
US-09-852-261-5 

Query Match 90.4%; Score 467.4; DB 9; Length 523; 

Best Local Similarity 96.2%; Pred. No. 6.1e-144; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 




Qy 




Db 



Qy 




Db 



Qy 



181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



Db 




Qy 




Db 



Qy 



2 98 AGAAGGAAAGGAAGTACATTTGAAGAACAO^AGTAGAGGGAGTGCAGGAAACAAGAACTA 357 



Db 




Qy 



358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 



Db 




Qy 



417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 476 



Db 




Qy 



4 77 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 i I II - 

Db 481 ayUVGATGG(^TTTCCCCO^TGAAATA(^C^GTAAACAT 521 



RESULT 3 

US-09-852-261-13 

Sequence 13, Application US/09852261 
Patent No. US20020083477A1 
GENERAL INFORMATION: 
APPLICANT: GOLDSPINK, GEOFFREY 
APPLICANT: TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852,261 
CURRENT FILING DATE: 2 001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 13 
LENGTH: 471 
TYPE : DNA 

ORGANISM: Oryctolagus cuniculus 
US-09-852-261-13 

Query Match 73.0%; Score 377.2; DB 9; Length 471; 

Best Local Similarity 87.8%; Pred. No. 4e-114; 

Matches 455; Conservative 0; Mismatches 13; Indels 50; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IMIIIIIIIIIMIMIII IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIM 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

llllilllllllllllllllllllMIII II Illlllllllllllllllll MINI 

Db 61 AGGGGCTTTTATTTC7yV(^GCCC^CAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTT CCGGAGCTGTGAT CTAAGGAGG CTGGAGATGTAT 18 0 

II I llllll I MM II I III MM llllll I MMIM Ml lllllllllllllllll 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

II llllll lllllll I 1 1 III 1 1 1 1 Mill MINIM! llllll Mill 

Db 181 TGTGCACCCCTCAAGCCGGO^GGC^GCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACA 3 00 

IIIIMIIMI III 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^GGAAACAAGAACTACAG 360 

llllll MIMMMIIMIMM III llllll I llllll I II Mill IIIIIIII 

Db 256 AAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 311 

Qy 3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

llllll II II III II III I II 1 1 MM II I II I III Ml MM Ml II II I III III 1 1 

Db 312 GATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 371 



Qy 42 0 TGCACAGTTACCTGTAAACATTGGAA^ 479 

liMIIMI MlllhlllMI IMill IIIMII MMII Mllirillll 

Db 372 TGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATC^C^TTTCAA 431 

Qy 48 0 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

III I MM II I II MM I MINI I llllll 1 1 III II 

Db 432 AGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 4 69 



RESULT 4 

US-09-919-497-24 

Sequence 24, Application US/09919497 
Patent No. US2002 0106662A1 
GENERAL INFORMATION: 
APPLICANT: Mutter, George L. 

TITLE OF INVENTION: PROGNOSTIC CLASSIFICATION OF ENDOMETRIAL CANCER 
FILE REFERENCE: B0801/7225 

CURRENT APPLICATION NUMBER: US/09/919 , 497 
CURRENT FILING DATE: 2001-07-31 
PRIOR APPLICATION NUMBER: US 60/221,735 
PRIOR FILING DATE: 2000-07-31 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 24 
LENGTH: 7260 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-919-497-24 

Query Match 66.6%; Score 344.2; DB 10; Length 7260; 

Best Local Similarity 87-3%; Pred. No. 1.3e-102; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCA(^GGGTATGGCTCCAG(^GTCGGAGGGCGCCTCAG 120 

IIIIMIIIIIIIIIIIIIII llllllllllll Mill I llllll! IIIIIIIIIIIIII 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

M llllll MMMIMMIMMMIIMMMIMM lllllll lllllll lllllll 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I M II M II I II I II I I II M I II II I I II I II I I II II I II I I I II I I I II I II I I 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAA^ 3 00 

II lllllhllll 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAAC 360 

II II M M I M I M M II 1 1 1 MIIIIIIIMIMIIIIIIIIIIIIIIIMIII 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGC^GGAAACAAGAACTACAG 621 



Qy 3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

III I II llllll III MM Ml I llllll I II I 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 

Db 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 4 77 

Mill I III I MM Mill llllll Ml MMMMMMIIMM I i 1 1 1 

Db 682 TGCA.CGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 4 78 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

llllll III MM llllll llllll , Mill I 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 5 

US-09-880-107-3739 

; Sequence 3739, Application US/09880107 

; Patent No. US20020142981A1 

; GENERAL INFORMATION: 

; APPLICANT: Home , Darci T. 

; APPLICANT: Vockley, Joseph G. 

; APPLICANT: Scherf, Uwe 

; APPLICANT: Gene Logic, Inc. 

; TITLE OF INVENTION: Gene Expression Profiles in Liver Cancer 
; FILE REFERENCE: 44921 -5028 -WO 

CURRENT APPLICATION NUMBER: US/09/8 8 0 , 107 
; CURRENT FILING DATE: 2001-06-14 
; PRIOR APPLICATION NUMBER: US 60/211,379 
; PRIOR FILING DATE: 2000-06-14 

PRIOR APPLICATION NUMBER: US 60/237,054 

PRIOR FILING DATE: 2000-10-02 
; NUMBER OF SEQ ID NOS : 3 950 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 3739 
LENGTH: 7260 
TYPE : DNA 
; ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: Genbank Accession No. US20020142981A1 X57025 
US-09-880-107-3739 

Query Match 66.6%; Score 344.2; DB 10; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 1.3e-102; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Ml II MM I llllll I llllll I llllll MIMM I MMMMMMIIMM MM 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAAOUVGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

II MIMM I llllll 1 1 Mill I Mill 1 1 MIMM llllll I IIIIIMI llllll I 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCATCGTGGATGAGTGCTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

I I I M M II I I I II I M II II II I I I II M I I II II I II I I II I I LI II II I I II I I || I 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 



Qy 



181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



Illlll MINI IIIMII Mlllll MINI IIIIIIM IIIIIIIM 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTaVGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 3 00 

I I I I II I I I I I I I I I 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^GGAAACAAGAACTACAG 360 

1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 ) 1 1 iiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiii 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 621 

Qy 3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

Illlll I II I Ml II llllllllllllll MM II I IIIIIIM MINIM II 

Db 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 42 0 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

MMI Mlllll II Mill Illlll III Illlll I IMIMI II II Illlll 

Db 682 TGC^CGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 478 AAAGAT -GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

Illlll III II III Illlll II III Mill I III MM I 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 6 

US-09-873-319-707 

; Sequence 707, Application US/09873319A 

; Publication No. US2003 0134324A1 

; GENERAL INFORMATION: 

; APPLICANT: Munger, William E. 

; APPLICANT: Kulkarni , Prakash 

; APPLICANT: Getzenberg, Robert H. 

; APPLICANT: Waga , Iwao 

; APPLICANT: Yamamoto, Jun 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
; TITLE OF INVENTION: Hyperplasia Using Gene Expression Profiles 

FILE REFERENCE: 44 92 1 -502 9 -US 

CURRENT APPLICATION NUMBER: US/09/873 , 319A 
/ CURRENT FILING DATE: 2001-06-05 

EARLIER APPLICATION NUMBER: US 60/223,323 
; EARLIER FILING DATE: 2000-08-07 
; NUMBER OF SEQ ID NOS : 755 

SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 707 
LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo. sapiens 
FEATURE : 

OTHER INFORMATION: Genbank Accession No. US2003 0134324A1 X57025 
US-09-873-319-707 

Query Match. 66.6%; Score 344.2; DB 13; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 1.3e-102; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II Ml Ml II Ml Ml Illlll Mill II I III II III MM 1 1 Mill II Mill II II 



Db 


311 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


370 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 
1 I 1 I 1 1 1 1 1 ! 1 I 1 t 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


120 


Db 


371 


1 1 II 1 II II II M II 1 II II 1 1 1 II II II II II II II 1 II II 1 II II II II II 1 1 1 II II 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


430 


Qy 


121 


ACAGG CATCGTGGATGAGTGCTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 

I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 II 1 1 1 I 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II 1 1 1 1 It II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 1 1 I I I I I I I I 

ACAGG CATCGTGGATGAGTGCTG CTT CCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


431 


490 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 
1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 M ll l l 1 l l l l l l l i I I i I I I I I I I i i t i i 
1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II II 1 M 1 

TGCGCACCCCT(^GCCTGC(^GTa^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


491 


550 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCT^CAAGAACA 
1 1 1 1 1 1 1 1 1 1 1 1 M 1 


300 


Db 


551 


1 II 1 1 1 1 1 1 1 1 1 I 1 1 

ATGCCCAAGACCCAG 


565 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 II 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 II 1 
M M M 1 II M M II M 1 1 II 1 1 1 1 1 II I I I I II | | | | | | | | | | | | | | | | | | | | | 

AAGGAAGTACATTTGAAGAACGC^GTAGAGGGAGTGCAGGAAACAAGAACTACAG 


360 


Db 


• 566 


621 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 
1 1 1 1 1 1 IIIIIIIII 1 1 1 1 1 1 1 1 1 M 1 1 1 I I I I I I I I I I I i I I I I 1 1 1 i i M i i 


419 


Db 


622 


1 1 1 1 1 1 IIIIIIIII 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 IIIIIIIIIIII IIIIIIIII 

GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 


681 


Qy 


420 


TGCAC -AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 

Mill II III MM Mill IIIIM III IIIIM IIIIIIIII III IIIIM 

TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 


477 


Db 


682 


741 


Qy 


478 


AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

IIIIM III llllllllllllllllllllllllllllll 

AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 




Db 


742 





RESULT 7 

US-09-960-706-1066 

; Sequence 1066, Application US/09960706 

; Publication No. US20030134280A1 

; GENERAL INFORMATION: 

; APPLICANT: Munger, William E . 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
Hyperplasia Using 

; TITLE OF INVENTION: Gene Expression Profiles 
; FILE REFERENCE: 44 92 1 -502 9 - 01US 
; CURRENT APPLICATION NUMBER: US/09/960 , 706 
; CURRENT FILING DATE: 2001-09-24 

PRIOR APPLICATION NUMBER.: 60/223,323 

PRIOR FILING DATE: 2000-08-07 
; PRIOR APPLICATION NUMBER: 09/873,319 
; PRIOR FILING DATE: 2001-06-05 
; NUMBER OF SEQ ID NOS : 1124 

SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 1066 
LENGTH: 72 60 
TYPE: DNA 

ORGANISM: Homo sapiens 



FEATURE : 

OTHER INFORMATION: Genbank Accession No. US20030134280A1 X57025 
US-09-960-706-1066 

Query Match 66.6%; Score 344.2; DB 13; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 1.3e-102; 



Matches 


455; Conservative. 0; Mismatches 13; Indels 53; Gaps 


5 


Qy 


1 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


311 


MIMMMMMM Mill IIIIIIII Mill IIMIIIMIIIIMIIIII III MM 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


370 


Qy 


61 


AGGGGCTTTTATTT(^(^GCCCAC^GGGTATGGCTCC^GCAGTCGGAGGGCGCCTCAG 


120 


Db 


371 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


430 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


431 


1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

ACAGG CAT CGTGGATGAGTG CTGCTT CCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


490 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

mm Miiiiii iiiiiiii ill him : ill iiiiihiiiiiiiii Mill 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


491 


550 


Qy 


241 


ATGCCCAAGACCC^GAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGA 


300 


Db 


551 


III M MINIMI 1 

ATGCCCAAGACCCAG 


565 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCA.GGAAACAA 

1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 i I 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 


360 


Db 


566 


1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 

AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 


621 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 


419 


Db 


622 


MIMI MMIMM MMIMMIMM MM MMIMIMM MINIM 

GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 


681 


Qy 


420 


TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 


477 


Db 


682 


Mill IIIIIIII 1 Mill MIMI III III Illllllllllllll MIMI 

TGC^CGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 


741 


Qy 


478 


AAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MIMI III II I I IIIIIIII 1 

AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 




Db 


742 





RESULT 8 . 
US-10-136-639-4 

; Sequence 4, Application US/10136639 
; Publication No. US2003 0072761A1 
; GENERAL INFORMATION; 
; APPLICANT: LeBowitz, Jonathan 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TARGETING PROTEINS ACROSS 
THE BLOOD BRAIN 

; TITLE OF INVENTION: BARRIER 

FILE REFERENCE: SYM-008 
; CURRENT APPLICATION NUMBER: US/10/13 6,63 9 
; CURRENT FILING DATE: 2002-09-06 



PRIOR APPLICATION NUMBER: US 60/329,650 
PRIOR FILING DATE: -2001-10-16 
NUMBER OF SEQ ID NOS : 4 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 4 
LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-136-639-4 

Query Match 66.6%; Score 344.2; DB 15; Length 7260; 

Best Local Similarity 87.3%; Pred. No. 1.3e-102; 

Matches 455; Conservative 0; Mismatches 13; Indels 53; Gaps 5; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II 1 1 ! 1 1 1 1 1 MM M II I M I ! 1 1 MM Ml 1 1 1 1 1 1 1 1 ! 1 1 1 1 M Ml 1 1 lil 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTC^CAAGCCCACAGGGTATGGCTCC^GC^GTCGGAGGGCGCCTCAG 12 0 

I I I I i I I I J I I i I I I I I I I I I I I I I I : I I ! I I I I I I - I I I I I I [ I I I I I I I I 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 180 

IIMMMMIMMMMMMMMMMMMMMMMMMIMMIMIMM 

Db 431 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGCG(^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM MIMIMI IMIIIII I MIMIMI IMIIIIIIMIIMIIM MIMI 

Db 4 91 TGCGCACCCCTCAAGCCTGCC^AGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACA^ 3 00 

II II I II I II I Ml I 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGGAAAGGAAGTAGA.TTTGAAGAACACAAGTAGAGGGAG 3 60 

I Ml II II I MM MIMI II llllllll III ill ill lillliliiiiiniii 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 621 

Qy 3 61 GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 

iiiiii'iiiiiiiii 1 1 r r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 minimum 

Db 622 GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 681 

Qy 4 20 TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 477 

Mill MIMIMI MIMI Mill III I II II II II 1 1 1 II II 1 1 1 1 II 1 1 

Db 682 TGCACGAGTTACCTGTTAAACTTTGGAA.CACCTACCAAAAAATAAGTTTGATAACATTTA 741 

Qy 478 AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

MIMI III I I I I I I I I I I I I I I I I i I I I I I I I I I I I II 

Db 742 AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 782 



RESULT 9 

US-10-207-655-54 

Sequence 54, Application US/10207655 
Publication No. US20030118592A1 
GENERAL INFORMATION: 
APPLICANT: Ledbetter, Jeffrey A. 



; APPLICANT: Hayden-Ledbet ter , Martha S. 

; TITLE OF INVENTION: BINDING DOMAIN- IMMUNOGLOBULIN FUSION PROTEINS 

; FILE REFERENCE: 390069. 401C1 

; CURRENT APPLICATION NUMBER: US/10/2 07 , 655 

; CURRENT FILING DATE: 2002-07-25 

; NUMBER OF SEQ ID NOS : 426 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 54 

LENGTH: 72 5 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-207-655-54 



Query Match 66.3%; Score 342.6; DB 15; Length 725; 

Best Local Similarity 87.1%; Pred. No. 1.4e-102; 

Matches 454; Conservative 0; Mismatches 14; Indels 53; Gaps 5 



yy 


JL 


LjLjAL-CULtAIjAUjU 1L1 CjUULtLtUG 1 UAUG 1 GG 1 GGA I GL 1 H 1 CAGTTCGTGTGTGGAGAC 

II II II Ml II II Mill Mill Ml llllllllllllllllllllllllllllll || 

LjLjAL-ULjIjALjALLj^ 1L1 (jtIAjLtUUL. 1 (jjAUU 1 UG 1 GGA 1 LjL 1 C 1 1 (JAG 1 i LG1 GTGTGGAGAC 


60 


uu 


1 >=;£ 

Ijd 


215 


wy 


£ 1 


ALAj<jjLtL^1 1 1 1A1 1 I LlAALAAoL-CLALAtjLjLiiA .LLAjU 1 LLALrtJAlj 1 LGGAGGGLGLC 1 GAG 


120 


uu 




1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ALjLj(jLtL.1 1 1 1A1 1 1UAALAAULLUACAOGG lAlLrGLl LGAGCAG 1 LGGAGGGCGLCTCAG 


275 




10 1 


A PAPP (~*7\ TPPTPP A TT^ A PTPPTP prnmpppip A V" 1 /""TV 1 TV" 1 7\ T»/'HT , 7\ 7\ PP 7\ /~i /~i /^trri/~i 71 tv mnmii m 

ALALtLtLA i LLj 1 Lj(jA 1 LtALj i (jU 1 GL I 1 GLGGAGL 1 G 1 GA 1 L 1 AAGGAGGLTGGAGATGTAT 

Mill II Mill II II III Mill III II llllllllllllllllllllllllllllll 

ALALrU 1 A 1 CG 1 GGA 1 GAG 1GLIGLII LCGGAGLTGTGATCTAAGGAGGCTGGAGATGTAT 


180 




A I b 


335 


Qy 


181 


TGCGCACCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

II III II II Mill Mill MINI MINI III II III III Mill MINI MM III 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


336 


395 


Qy 


241 


ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAAC 

MIMMMMIMI 


300 


Db 


396 


ATGCCCAAGACCCAG 


410 


Qy 


301 


AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGG 

MMMMI MMMMMM II II II M II II 1 II 1 II 1 M 1 II II 1 1 II II II 

AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAG 


360 


Db 


411 


466 


Qy 


361 


GATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 


419 


Db 


467 


MIMI i 1 Mill MM 1 II 1 II II 1 II 1 MM 

GATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTC 


526 


Qy 


420 


TGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTC 


477 


Db 


527 


II Ml II Mill II Mill 1 1 M II III IIIIIIIIIIIIIIIIII MINI 

TGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTA 


586 


Qy 


478 


AAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 




Db 


587 


MIMI III llllllllllllllllllllllllllllll 

AAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACAT 627 




RESULT 10 
US-09-852- 
; Sequence 


261- 
3, 


-3 

Application US/09852261 





Patent No. US20020083477A1 
GENERAL INFORMATION: 
APPLICANT: GOLDSPINK, GEOFFREY 
APPLICANT: TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/09/852,261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 3 
LENGTH: 539 
TYPE: DNA 

ORGANISM: Rattus sp . 
US-09-852-261-3 

Query Match 62.9%; Score 325.2; DB 9; Length 539; 

Best Local Similarity 81.2%; Pred. No. 6.8e-97; 

Matches 429; Conservative 0; Mismatches 88; Indels 11; Gaps 4; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II II II III II I Ml Mill 1 1 III MM I IMMIIII MM 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 6 0 

Qy 61 AGGGGCTTTTATTTCAAC^GCCC^(^GGGTATGGCTCCAG(^GTCGGAGGGCGCCTCA^ 12 0 

II IMMIIII III llllllllll Ml MM II II I Ml I IMMIIII II III 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

M Mill IMMMIIII IMMMMMIIMMIM 

Db 121 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAG CTGTGAT CTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGCGC^CCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

II I I I lllllll I Ml I Ml MM II MM IIIIIM III I III III 

Db- 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCAT^ 24 0 

Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCA^ G 2 97 

lllllllllll lllllll IMIMI III II IIIIIM I MM II I 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCT^ 3 00 

Qy 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTG CAGGAAACAAGAACTA 3 57 

Illlllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M M 1 1 1 III 

Db 3 01 AGAAGGAAAGGAAGTACACTTGAAGAAC^C^GTAGAGGAAGTGCAGGAAAC^GACCTA 360 

Qy 3 58 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Ml Mill II II III I Mill I II I II I lllllll II llllll 

Db 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 

Qy 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA ----- - AAAAATAAGTTTGATC 470 

M I llllll MM III MM II M Mill II 

Db 421 CTGCTTGAGCAACCTGCAAAAOITCGGAACACCT^ 4 80 

Qy 471 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

lllllll MM IMIMIMI IMMMIIMMIMMIMM 

Db 4 81 TC^TTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACAT 52 8 



RESULT 11 
US-10-161-088-1 

; Sequence 1, Application US/10161088 

; Publication No. US20030077761A1 

; GENERAL INFORMATION: 

; APPLICANT: Parrow, Vendela 

; APPLICANT: Rosengren, Linda 

; TITLE OF INVENTION: NEW METHODS 

; FILE REFERENCE: 13425-111001 

; CURRENT APPLICATION NUMBER: US/ 10/161 , 088 

; CURRENT FILING DATE: 2002-05-31 

PRIOR APPLICATION NUMBER: SE 0101934-8 

PRIOR FILING DATE: 2001-06-01 
; NUMBER OF SEQ ID NOS : 3 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 1 

LENGTH: 651 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (73) . . .. (471) 
US-10-161-088-1 

Query Match 61.5%; Score 318.2; DB 15; Length 651; 

Best Local Similarity 81.7%; Pred. No. 1.5e-94; 

Matches 419; Conservative 0; Mismatches 83; Indels 11; Gaps 4 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

Mill Mill II IMMMMMIMMIMMMMMMMMMMIMM 

Db 13 9 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 



Db 




Qy 




Db 



Qy 



181 TGCGCACCCCTC7VAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



Db 




Qy 



241 ATGCCf^GACCCAGAAGTATCAGCCCCCATCTACCAACAAGAAOVCGAAGTCTCA- - -G 2 97 



Db 




Qy 



298 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 



Db 




Qy 



3 58 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 



Db 




Qy 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 470 

II I II III I I II III ML II lllllll II 

Db 55 9 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAACAAT 618 

Qy 471 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAA 502 

Mill lllllll IMIIIIIIIIMIIIIII 

Db 619 ACATTACAAAGATGGGCATTTCCCGCAATGAAA 651 



RESULT 12 
US-10-251-661-7 

; Sequence 7, Application US/10251661 
; Publication No. US20030166555A1 
; GENERAL INFORMATION : 
; APPLICANT: Alberini, Cristina M. 
; APPLICANT : Bear, Mark F. 

; TITLE OF INVENTION: Methods and Compositions for Regulating 
; TITLE OF INVENTION: Memory Consolidation 

FILE REFERENCE: 3499.1001-003 

CURRENT APPLICATION NUMBER: US/ 10/251 , 661 

CURRENT FILING DATE: 2002-09-20 
; PRIOR APPLICATION NUMBER: 60/193,614 
; PRIOR FILING DATE: 2000-03-31 

PRIOR APPLICATION NUMBER: PCT/USOl/l 066 1 
; PRIOR FILING DATE: 2001-04-02 
; NUMBER OF SEQ ID NOS : 12 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 7 

LENGTH: 612 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/KEY: CDS 
LOCATION: (151) . . . (564) 
US-10-251-661-7 



Query Match 55.2%; Score 285.4; DB 13; Length 612; 

Best Local Similarity 86.5%; Pred. No. l.le-83; 

Matches 359; Conservative 0; Mismatches 6; Indels 50; Gaps 2 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IMMMMMMMMIMMMMMMMMIMMMMMMMMMMIMM 

Db 247 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3 06 

Qy 61 AGGGGCTTTTATTT(^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

MMMMMMIMMIMIMMIMMMMIMMMMMMMMMMIMM 

Db 3 07 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 366 



Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 

II MM III II MM MM MM II III II Ml III lllllll Mill MM IN III II 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 


180 


Db 


367 


426 


Qy 


181 


TGCGC^CCCCTC^GCCTGCCAAGTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

1 lllllll MMII III II 1 M III Ml III II III 1 Mill Mill II IMIIIIMM 

TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


427 


486 



Qy 241 ATGCCCAAGACCCAGAAGTATCAGCCC^ 3 00 

II Mil MINI 

Db 48 7 ATGCCCAAGACCCAG 5 01 

QY 3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 360 

MMMMMIMMMMM MMMMMMMMMMMMMIMIMM 

Db 5 02 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTACAG 557 

Qy 3 61 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTT 414 

- MMII M 1 1 1 1 1 1 1 II II I III Mill I MM M MUM III! MM 

Db 558 GATGTAGGAAGAC CCTCCTGAGGAGTGAAGAGTGACATGCCAC CGCAGGATC CTT 612 



RESULT 13 
US-09-852-261-9 

; Sequence 9, Application US/09852261 

; Patent No. US2 0020083477A1 

/ GENERAL INFORMATION: 

; APPLICANT: GOLDS PINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER : GB 0011278.9 
; PRIOR FILING DATE: 2000-05-10 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 9 

LENGTH: 318 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-852-261-9 



Query Match 50.0%; Score 258.4; DB 9; Length 318; 

Best Local Similarity 99.6%; Pred. No. 6,.6e-75; 

Matches 259; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 M II I II II I II I II II I II 1 1 II I M M II M M I M M I IMI 1 1 II I II II M I 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

QY 61 AGGGGCTTTTATTT(^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

I Mill I III I MM I Mill Ml Mill MM I MMII III III 1 1 MM I MM III 

Db 61 AGGGGCTTTTATTTC^O^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

QY 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

I Mill MM I MM MIM IMI I MM MM! MMII III Ml 1 1 IMMMII III 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 

Qy 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 24 0 

MMIMMIMMIMMMMMMIIMMMMMMIMMMMMIMIMM 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTC^ 24 0 

Qy 241 ATGC C CAAGACCCAGAAGTA 260 

I MM I IMI MIM III I 

Db 241 ATGCCCAAGACCCAGAAGGA 26 0 



RESULT 14 
US-09-852-261-11 

; Sequence 11,. Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

FILE REFERENCE: 117-351 
; CURRENT APPLICATION NUMBER: US/09/852,261 
; CURRENT FILING DATE: 2001-05-10 

PRIOR APPLICATION NUMBER: GB 0011278.9 

PRIOR FILING DATE: 2000-05-10 
; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 11 
LENGTH: 487 
TYPE: DNA 

ORGANISM: Rattus sp. 
US-09-852-261-11 

Query Match 47.9%; Score 247.8; DB 9; Length 487; 

Best Local Similarity 74.5%; Pred. No. 2.6e-71; 

Matches 391; Conservative 0; Mismatches 77; Indels 57; Gaps 4 
QY 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 Mill II MM III II I Mill MM I II II III Ml Ml Ml II Ml 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

QY 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGT^ 12 0 



Db 




Qy 



121 ACAGG CAT CGTGGATGAGTG CTG CTT CCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 



Db 




Qy 



181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 



Db 




Qy 




Db 



Qy 



3 01 AGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGA 360 



Db 




Qy 



3 61 GATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTC 419 



Db 




Qy 



42 0 TGCACAGTTACCTGTAAACATTGGAATACCGGCCA 



■AAAAATAAGTTTGATCACA 473 



lllllllll III 1 1 1 1 



II III MM II II 



Db 372 CTTGAGCAACCTGOVAAACATCGGAACACCTGCCAAATAT 431 

Qy 474 TTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 

I I I I I I I I I IIIIIIIIII MM Mill IIIMIIIilli 

Db 432 TTTC^GAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACZAT 4 76 



RESULT 15 
US-10-238-114-1 

; Sequence 1, Application US/10238114 
; Publication No. US20030100073A1 
; GENERAL INFORMATION: 
; APPLICANT: Merial 

; APPLICANT: ANDREONI , Christine Michele 

; TITLE OF INVENTION: IGF-1 AS FELINE VACCINE ADJUVANT , IN PARTICULAR AGAINST 
FELINE RETROVIRUS 

; FILE REFERENCE: 454313-3165.1 

; CURRENT APPLICATION NUMBER: US/ 1 0/238 , 114 

CURRENT FILING DATE: 2002-09-10 

PRIOR APPLICATION NUMBER: FR 01 11736 
; PRIOR FILING DATE: 2001-09-11 
; PRIOR APPLICATION NUMBER: US 60/318,666 
; PRIOR FILING DATE: 2001-09-12 
; NUMBER OF SEQ ID NOS : 2 0 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 

LENGTH: 4 62 
TYPE: DNA 

ORGANISM: Felis catus 
US-10-238-114-1 



Query Match 44.1%; Score 228; DB 15; Length 462; 

Best Local Similarity 92.3%; Pred. No. 9e-65; 

Matches 240; Conservative 0; Mismatches 20; Indels 0; Gaps 0;. 



Qy 


l 


GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


145 


Mill MINI III II lllllllil II 1 MM Ml MM llllll MINI Mill 

GGACCAGAGACGCTCTGTGGGGCTGAGTTGGTGGACGCTCTTCAGTTCGTGTGTGGAGAC 


204 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


120 


Db 


205 


MIM Ml III Mill MMIII II IMMMIMMMM llllll MM llllll 

AGGGGTTTTTATTTCAAOUVGCCCACGGGGTATGGCTCCAGCAGTCGGAGGGCACCTCAG 


264 


Qy 


121 


ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 

MM Mill III III IMIIIM MMIII Mill II MM Ml MM llllllll 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGCGGCTAGAGATGTAC 


180 


Db 


265 


324 


Qy 


181 


TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

M 1 M 1 II II II II M II II II 1 M II Mill llllllll IMIMIMM Ml 

TGTGCACCCCTCAAGCCTGCO^GTCTGCCCGCTCAGTCCGTGCTCAGCGCCACACTGAC 


240 


Db 


325 


384 


Qy 


241 


ATGCCCAAGACCCAGAAGTA 26 0 




Db 


385 


Ml Ml III 1 MMM 1 

ATGCCCAAGGCTCAGAAGGA 4 04 





Search completed: December 13, 2003, 11:56:45 



Job time : 232.833 sees 



